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Abstractions 


FIRST AUTHOR 
In the birth defect 
phocomelia, the long 
bones of the limbs are 
shorter than normal or, 
in the worst cases, do not 
develop at all, meaning 
that the hand or foot is 
attached directly to the shoulder or hip. 
Scientists studying the disorder — also seen 
in the 1950s and 1960s in the children of 
women who were prescribed the sedative 
thalidomide while pregnant — can mimic its 
characteristics by exposing the limb buds of 
developing chick embryos to irradiation. 
Researchers had thought that the effects 
of both X-irradiation and thalidomide 
resulted in a defect in patterning — 
the correct laying-out of cells during 
development. In this model, a cell's identity 
is determined by the length of time it spends 
in a region near the tip of the developing 
limb bud. The thinking was that X-rays 
or thalidomide forced cells to stay too 
long in this region, called the progress 
zone, blocking their ability to form the 
bone cells of the limb regions closest to 
the shoulder and hip. But postdoc Jenna 
Galloway at Harvard Medical School in 
Boston, Massachusetts, and her colleagues 
have discovered that X-irradiation leads to 
problems with differentiation, the process by 
which progenitor cells assume specialized 
characteristics and functions (see page 
A400). She tells Nature more. 


How does irradiation affect limb-bud cells? 
During normal development the limb bud 
grows out and skeletal progenitor cells 
differentiate into bone cells. After we 
irradiated chick embryos, their limb buds 
lost some of the progenitor cells that become 
bone cells in the upper and middle parts of a 
limb, or proximal segments. These proximal 
segments still form, based on the presence 
of segment-specific, or patterning, markers. 
But when we looked for a differentiation 
marker for skeletal progenitors, we could not 
find it in the proximal segments. The marker 
was left only in the distal segments, the cells 
that would develop into hands or feet. 


Were any results unexpected? 

We were surprised that a major signal for 
limb growth and for telling cells to become 
distal cells was not disrupted by X-rays. This 
signal remained intact, which is why that part 
of the limb develops even after irradiation. 


Did you encounter challenges? 

When | started irradiating chick embryos, | 
put whole eggs into the X-ray machine. The 
embryos were fine, but did not get irradiated 
because of the shell. Then | tried cutting out 
asection of eggshell, leaving the embryo 
exposed, but it died. Finally, | cut a hole in the 
egg and placed shell pieces over part of the 
embryo to shield it, and that worked. | 


MAKING THE PAPER 
Susan Alberts and Greg Wray 


Baboon susceptibility to a parasite 
reveals parallels with humans. 


During the past four decades, US and Kenyan 
scientists have been monitoring a popula- 
tion of yellow baboons (Papio cynocephalus) 
in the Amboseli National Park in Kenya. The 
animals’ detailed life histories are now being 
combined with genetic analyses to gain a 
glimpse into their evolutionary past. 

As a first step, Susan Alberts and Greg Wray 
of Duke University in Durham, North Caro- 
lina, and their colleagues have identified a 
variation in a baboon gene that is associated 
with susceptibility to a malaria-like pathogen 
— the first report ofa link between a gene and 
a complex trait in a wild population of non- 
human primates (see page 388). The fact that 
a similar association exists in humans suggests 
that both species endured comparable pressures 
from pathogens during their evolution. 

“Like early humans, baboons are savannah 
foragers that adapted to a wide range of habitats 
and opportunities. They can be put in almost 
any environment and will thrive,’ says Alberts. 
“We think they share a parallel evolutionary 
history with humans.” 

Because of such parallels, the Amboseli 
Baboon Research Project was established in 
1971 by Jeanne and Stuart Altmann of Prince- 
ton University, New Jersey, to study a popu- 
lation of about 300 baboons organized into 
5 social groups, or extended families. “Liter- 
ally every day someone is observing them,” says 
Alberts, who co-directs the project with Jeanne 
Altmann. 

In addition to careful monitoring of baboon 
behaviour, such as eating and sleeping habits, 
researchers have gathered physical measure- 
ments, as well as hormone and DNA samples 
from the animals’ faeces. A few years ago, 
Alberts and Wray, a geneticist, started discuss- 
ing the idea of applying genetic methods to gain 
further insight into these animals. 

Both researchers credit first author Jenny 
Tung with getting that idea off the ground. She 
wanted to find connections between variations 
in certain genes, or genotypes, and particular 
traits (phenotypes) in the Amboseli baboons. 

As a first phenotype, they chose to examine 
the susceptibility of baboons to infection with 
the blood parasite Hepatocystis kochi, which is 
related to the Plasmodium parasite that causes 
human malaria. Human susceptibility to patho- 
gens is a trait that is often linked to single genes, 
and is thus amenable to genetic analyses. The 
main challenge for doing this type of study in 
wild animals is obtaining enough high-quality 
DNA and RNA samples for analysis. 

Tung and Alberts spent two months during 


A 


Susan Alberts (left) and Greg Wray. 


three consecutive summers living in tents at the 
Amboseli research site, collecting blood samples 
from baboons to extract DNA and RNA. Work- 
ing with Kenyan researchers, they first had to 
put the animals to sleep by darting them, while 
being careful to minimize disruption to the 
group. “We would only dart animals who were 
alone, away from the rest of their social group, 
and no more than two animals each day and no 
more than six each week,” says Alberts. “As a 
result the animals never lost habituation to us.’ 

Back in the lab at Duke, the group ana- 
lysed DNA samples from 190 baboons for the 
presence of Hepatocystis DNA. They discov- 
ered that infection rates varied substantially 
between individuals. They then looked at the 
DNA sequences of several candidate genes to 
identify changes that were associated with the 
observed variations in Hepatocystis susceptibil- 
ity. In the end, they found that a change of one 
nucleotide in a gene called FY confers protec- 
tion from the parasite. A different variation in 
the human FY gene is known to confer protec- 
tion against malaria. 

The fact that a genotype is associated with 
similar traits in baboons and humans shows 
that “given the same evolutionary pressures, 
the same type of thing can happen in parallel 
in different organisms’, says Wray. “One lesson 
from this is the precision with which natural 
selection acts.’ Another important conclusion, 
notes Wray, “is that field studies such as this one 
can be done despite all the challenges”. 

Alberts and Wray next plan to tackle an even 
more ambitious project: finding genotypes asso- 
ciated with behavioural traits. In a 2003 paper 
in Science on baboon social integration, Alberts 
and her colleagues examined the observation 
that, although female baboons spend their 
whole lives in the same social group, some are 
well integrated — having lots of other baboons 
to groom with, for example — whereas others 
are more isolated. “We found that the degree of 
social integration predicts how well the offspring 
will do,’ says Alberts. “So there is an evolution- 
ary pressure on females to be social.” 

The team will now look for the genetic deter- 
minants of social isolation. “It is a risky project. 
But this is the kind of population you need to do 
itin,’ says Wray. “And if we cant do it here, then 
it probably can't be done” a 


Visit Nautilus for regular news relevant to Nature authors } http://blogs.nature.com/nautilus and see 
Peer-to-Peer for news for peer reviewers and about peer review } http://blogs.nature.com/peer-to-peer. 
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Net gains 


The Moon landing was not the only world-changing event in the summer of '69. 


314 reveals just how powerfully the Apollo programme moti- 
vated young people to become scientists 40 years ago — a fact 
today’s space scientists ignore at their peril (see pages 325 and 327). 

Yet other events in the summer of 1969 would lead to a far deeper 
empowerment of scientists — and, indeed, many others. Even as 
Apollo 11 was putting the first humans on the Moon, Ken Thomp- 
son at AT&T’s Bell Labs was working to get Space Travel, a computer 
game hed written for a mainframe computer, to run on a new, smaller 
machine. That effort led him to join with Dennis Ritchie and others to 
write a new computer operating system, which they named Unix. The 
rest is history: Unix triggered a still-ongoing boom in scientific com- 
puting, set the pattern for the open-source software movement and, 
along with its descendants, laid the foundations for the Internet. 

Thompson wrote the first version of Unix in four weeks. It was ini- 
tially for internal use only. But when the code was licensed for use 
outside of Bell Labs in the 1970s, it was quickly adopted by scientists 
worldwide. They embraced Unix for its power, simplicity and its abil- 
ity to let machines interact with multiple users at once. No longer did 
they have to run stacks of punch cards through huge mainframes and 
wait hours or days to get their results on reams of fan-fold paper. Now 
they could just type and the machine would respond. 

Timing was also key to the success of Unix. It appeared just as the 
mainframes were being challenged by a new generation of smaller, 
cheaper, interactive ‘minicomputers’ suited to individual depart- 
ments and research groups. The new operating system could easily 
be adapted to run on any of these machines. Perhaps most impor- 
tantly, researchers loved the fact that Unix was written in C, anew 


\ n international, cross-disciplinary survey by Nature on page 


programming language that made it easy for them to write and share 
applications — and even to refine the operating system itself. They 
lost no time in doing so, in a flurry of innovation that presaged the 
open-source movement. 

In the 1980s and 1990s, that openness fell victim to shortsighted 
commercial interests, when infighting among vendors of proprietary 
Unix machines produced incompatible versions of the operating sys- 
tem. This allowed Unix to be blindsided by an upstart called Micro- 
soft, which quickly acquired a dominant position in the burgeoning 
microcomputer market with its own operating systems. 

Nonetheless, Unix’s future had been guaranteed in the late 1970s 
by the US Defense Advanced Research Projects Agency, which chose 
a version of the operating system developed at the University of Cali- 
fornia, Berkeley, as the basis for a number of its projects. And by fund- 
ing Berkeley to include an implementation of the then-new TCP/IP 
protocols that underlay the Internet, the agency essentially made Unix 
Internet-ready. At the same time, the open-source movement became 
central to the Internet's development, with the Unix clone Linux, writ- 
ten in 1991 by Finnish student Linus Torvalds, coming to dominate. 

Today, Linux or some other flavour of Unix runs most of the serv- 
ers, routers and other elements of Internet infrastructure, as well 
as most mobile phones and GPS devices. And last week Google 
announced that it will use Linux as the base for its planned open- 
source ‘Chrome’ operating system. 

The difference between the development of Unix and the top- 
down, colossal science and engineering project that was Apollo could 
hardly have been greater. Yet both are examples of the power of joint 
efforts. Unix, like Apollo, has earned its place in history. a 


Nowhere to hide 


The G8 has laid down a marker by promising to 
restrict the rise of global temperatures. 


(G8) industrialized nations pledged to try to keep the planet 

from warming by more than 2°C above pre-industrial tem- 
peratures — an ambitious goal that has also been adopted by other 
countries including China, Brazil and South Africa (see page 313). 

The absence of any commitment to reduce emissions before 2050 
is less promising but — assuming that the G8 can be taken at its word 
— it shouldn't bea fatal defect. If warming is to stay within those two 
degrees, the global ‘decarbonization’ project must be tackled without 
delay anyway. 

This will not be cheap or easy. When the two-degree ceiling is 
translated into a per-capita emissions limit, it is clear that the industrial- 
ized world, and particularly the United States, has already generated 
more than its fair share of greenhouse gases. By continuing to emit at 


\ t their annual summit last week, leaders of the Group of Eight 


these levels, these countries are taking up emissions allocations that 
developing nations need to use to grow. This creates a ‘carbon debt’ 
that must be repaid with technology and money, both of which will 
be necessary if poorer countries are to leapfrog the dirty develop- 
ment pioneered by the industrialized nations. 

The only viable strategy is therefore a massive research-and- 
development drive for energy efficiency in the near term, coupled 
with a long-term move towards virtually carbon-free energy pro- 
duction by 2040 at the latest. Cap-and-trade regulations can play a 
part, but they are only the beginning. Smart urban planning, clean 
transport systems and lifestyle changes will also be important. 

History teaches that great advances — such as agriculture, sanitation 
and mechanization — arrive in bursts. There is hope that, despite an 
agonizingly slow start, the energy revolution of the twenty-first century 
can still gain the required momentum. Avoiding long-term investment 
in high-emission technology will be more important than throwing 
around potentially unachievable emission-reduction numbers. 

The G8 has set a point of reference for the UN climate-change confer- 
ence in December. Global leaders must now put together a framework 
that allows — and requires — all nations to do their part. rT] 
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RESEARCH AIGHLIGHTS 


Smothered by aswarm 
Naturwissenschaften doi:10.1007/s00114-009- 
0575-0 (2009) 


Honeybees don't just kill hornets with heat — 
they gas them, too. The giant hornet (Vespa 
mandarinia japonica) is a ferocious predator of 
the Japanese honeybee (Apis cerana japonica). 
The bees fight back by smothering the invader 
inside a ball formed by their swarming bodies, 
killing the hornet in minutes. It was thought that 
the temperature inside the ball — which can rise 
to around 46 °C — killed the hornet while sparing 
the more heat-tolerant bees. 

Michio Sugahara and Fumio Sakamoto of Kyoto 
Gakuen University, Japan, found that outside 
a bee ball, hornets survive such temperatures. 
Their heat tolerance falls, however, as carbon 
dioxide levels rise. The air in a bee ball contains 
about 3.7% CO,, and in this atmosphere the 
temperature is lethal to the hornet. 


PALAEOCLIMATOLOGY 
Tropical ice 


Geophys. Res. Lett. doi:10.1029/2009GL037643 
(2009) 

Ice cores can reveal historical information 
about climate and vegetation through 
trapped aerosols such as dust, soot and 
complex molecules. 

Matthew Makou of Ohio State University 
in Columbus and a team of fellow 
ice enthusiasts looked at 
compounds derived from 
tropical plants that were 
blown to high altitudes, 
where they became 
trapped in ice. The 
researchers tried 
out relatively new 
methods — stir-bar 
sorptive extraction and 
thermal desorption 
— that allowed 
useful information to 
be extracted from such 
compounds without the need 
for many kilograms of ice. 

The authors suggest that the methods 
will be useful in reconstructing past 
characteristics of tropical areas, such as forest- 
fire occurrence, vegetation cover and aridity. 


NANOTECHNOLOGY 
Penned in protein 


Angew. Chem. Int. Ed. doi:10.1002/anie.200900950 
(2009) 

Etching precise protein patterns on a 
substrate is important both for medical 
diagnostics and for fundamental studies. 
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Until now, this has usually been done by first 
creating protein-attracting regions and then 

filling in the rest of the surface with protein- 

repellent molecules. 

Michael Zharnikov of the University of 
Heidelberg in Germany and his coworkers 
have now developed a technique that ‘writes’ 
protein patterns on toa substrate more easily. 
They do this by depositing an ethylene glycol 
layer ona gold surface and then zapping it 
with electron-beam lithography, 
creating hydrophobic areas that 
attract proteins only where 
the electron beam was 
focused (pictured left; 

yellow dots represent 
adsorbed protein). If 
the adsorbed proteins 
carry specific binding 
sites, additional 
proteins can then be 
attached as a secondary 
patterning motif. The 
approach could lead to 
the creation of sophisticated 
‘biochips: 


CHEMICAL BIOLOGY 
A glowing report 


Nature Chem. Bio. doi:10.1038/nchembio.190 (2009) 
Michael Tsang and colleagues at the 
University of Pittsburgh, Pennsylvania, 
wanted to find small-molecule modulators 
of the fibroblast growth factor (FGF) 
pathway, which is central to embryonic 
development. They screened chemicals in 
zebrafish that had been engineered to glow 
when FGF was active. One chemical known 
as BCI supercharged the glow, indicating it 
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was boosting FGF activity. 

Further study revealed that BCI blocks 
the activity of dual-specificity phosphatase 6 
(Dusp6), an FGF feedback regulator, thereby 
stopping it from impairing FGF and resulting 
in the extra glow. Compared with untreated 
zebrafish embryos, BCI-treated embryos 
had more cardiac progenitor cells — which 
eventually give rise to the heart — but fewer 
of the cells that go on to line blood vessels 
and form blood cells. The results, say Tsang’s 
team, demonstrate the utility of zebrafish in 
chemical screens in vivo. 


EVOLUTION 
Nice guys finish last 


Behav. Ecol. Sociobiol. doi:10.1007/s00265-009- 
0814-6 (2009) 

Selfish individuals can profit from the 
altruism of others in their group, and can 
even exploit a group’s resources so much that 
the resources become exhausted — an event 
known as the tragedy of the commons. 

Omar Eldakar, of the University of Arizona 
in Tucson, and his colleagues have now 
shown experimentally that aggressive mating 
in water striders (Aquarius remigis) can result 
in one such tragedy. Harassment of female 
water striders by males has previously been 
shown to drive away females, diminishing the 
mating success of all males in a group. 

The team built pools and manipulated 
the number of aggressive and nonaggressive 
males in each. They found that 
hyperaggressive males had greater mating 
success than those which were not aggressive 
within mixed pools. But as the number of 
hyperaggressive insects increased, the mating 
success of both types decreased. 
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ORGANIC CHEMISTRY 


Cockroach cruncher 


Angew. Chem. Int. Ed. doi:10.1002/anie.200902192 
(2009) 
Powdered leaves from the Mexican 
cockroach plant, Haplophyton cimicidum, 
have been used to poison cockroaches and 
to repel fleas for hundreds of years. And 
attempts to synthesize one of the plant’s 
active ingredients — the indole alkaloid 
(+)-haplophytine — have frustrated organic 
chemists ever since its complex molecular 
structure was solved in 1973. 

Hidetoshi Tokuyama at Tohoku University 
in Sendai, Japan, and his colleagues 
now report the first total synthesis of 
(+)-haplophytine, creating 4.2 milligrams of 
the molecule. Ina key step, the researchers 
performed an internal rearrangement to 
create the left-hand segment of the molecule. 
They then used a traditional Fischer indole 
synthesis to join that, through a central bond, 
to the previously synthesized right-hand 
segment. 


IMMUNOLOGY 


Themis in the thymus 


Nature Immunol. doi:10.1038/ni.1768; doi:10.1038/ 
ni.1766; doi:10.1038/ni.1769 (2009) 
The maturation of pathogen-fighting T cells 
relies on a mysterious protein called Themis, 
three independent groups have found. 
Themis is expressed exclusively in T cells, 
but its protein sequence — although highly 
conserved in vertebrates — provided few 
clues to its function. All three teams generated 
mouse mutants that lack the full-length 
Themis protein, and found that the usual 
process by which properly functioning 
T cells are selected to continue maturation in 


the thymus was impaired. The result: fewer 
T cells survived this selection, and the mice 
contained smaller numbers of mature T cells. 
Paul Love of the National Institutes of 
Health in Bethesda, Maryland, and his 
colleagues found that this defect could be 
reversed by stimulating signalling through 
a protein complex called T-cell antigen 
receptor (TCR), which is crucial for the 
activation and development of T cells. 
Meanwhile, a second group led by Nicholas 
Gascoigne of the Scripps Research Institute 
in La Jolla, California, demonstrated that 
signalling through the TCR mediated by 
two other key molecules — calcium and a 
protein called Erk — is reduced when Themis 
is lacking. And a team headed by Richard 
Cornall of the University of Oxford, UK, 
and Ronald Schwartz, also of the National 
Institutes of Health, found that the expression 
of several genes, including those involved in 
cell survival, was reduced in these mutants — 


possibly contributing to eventual T-cell death. 


DEVELOPMENTAL BIOLOGY 


The turtle fold 


Science 325, 193-196 (2009) 

The bodies of turtles and other chelonians 
are fundamentally different from those of 
all other animals. Their distinctive shells 


RESEARCH HIGHLIGHTS 


develop from the ribcage, which lies above, 
rather than below, the shoulder blades. 
Shigeru Kuratani and his colleagues at the 
RIKEN Center for Developmental Biology 
in Kobe, Japan, compared the embryogenesis 
of the Chinese soft-shelled turtle (Pelodiscus 
sinensis, pictured below left) with that of other 
animals to understand how developmental 
changes may have resulted in the turtle’s 
unusual form. The authors found that 
whereas the ribs of mice and chickens grow 
to encircle the body, those of turtles grow 
outwards and then stop, with the shoulder 
blades tucking under the edges of the ribs. 


NEUROLOGY 


New neurons show the way 


Science 325, 210-213 (2009) 

Researchers have uncovered a function of 
new neurons that are generated in the brains 
of adult mice. 

The role of adult neurogenesis in a brain 
region called the hippocampus has been 
difficult to pin down because the effects 
seem to be subtle and it is technically tricky 
to selectively disrupt neuronal development 
in adult animals. Fred Gage of the Salk 
Institute for Biological Studies in La Jolla, 
California, Timothy Bussey at the University 
of Cambridge, UK, and their colleagues 
designed challenging tasks that required 
mice to distinguish between very similar 
surroundings. 

Using two techniques to knock down 
neurogenesis, the authors discovered 
that although the new neurons were not 
necessary for easy navigational tasks, they 
were important for more complex ones. 
Notably, for remembering very small 
differences in the spatial arrangement of 
their environment. 


JOURNAL CLUB 


Friedhelm von Blanckenburg 
GFZ German Research Centre 
for Geosciences, Potsdam, 
Germany 


A geochemist learns that 
mountain building does not 
accelerate rock weathering. 


Mountain building has been 
deemed essential for stabilizing 
Earth's climate over the scale of 
millions of years. As tectonic forces 
push mountains into the sky, they 
provide fresh rock surface that is 
degraded by the physical effects of 
rain and temperature change, and 


by chemical weathering as carbon 
dioxide is dissolved in rain to form 
carbonic acid. Atmospheric CO, is 
thus consumed to convert rock into 
soil, which happens fastest where 
erosion rates are highest, exposing 
new rock to be weathered. Rivers 
then transport this carbon to the 
oceans where it is disposed of as 
carbonate sediment. 

But many Earth scientists have 
questioned this story. Wouldn't 
periods of high tectonic activity, 
such as the rise of the Himalayas, 
provide enough rock to consume 
so much CO, that the planet would 
turn into an ice house? 

Yet the consequences of 


mountain building are perhaps 

less profound than expected. The 
amount of weathering over time 
can be accurately measured on 

hill slopes using new geochemical 
methods that combine solute loss 
from soils with radioactive isotopes 
formed by cosmic rays to determine 
how long it takes for rocks to break 
down into soil. Two recent papers 
modelled the implications of this 
approach numerically. Surprisingly, 
the prediction is that weathering 
decreases rather than increases at 
erosion rates typical of high, active 
mountains (K. L. Ferrier and 

J. W. Kirchner Earth Planet. Sci. Lett. 
272, 591-599; 2008; E. J. Gabet and 
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S. M. Mudd Geology 37, 151-154; 
2009). So hill-slope weathering in 
the Himalayas might do no more 
to withdraw CO, than any actively 
eroding, mid-altitude mountain 
range found worldwide. 

Perhaps geochemists have 
been looking in the wrong place. 
Does the CO,-consuming mineral 
decomposition thought to occur 
on high slopes actually happen on 
the floodplains below large, active 
mountains? We might need to 
take a closer look at these areas 
before we really understand the 
geological carbon cycle. 

Discuss this paper at http://blogs. 
nature.com/nature/journalclub 
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NEWS 


NIH nominee draws scrutiny 


Francis Collins is likely to face funding challenges — and criticism of his Christian evangelism. 


As Francis Collins prepares to take the helm 
of the US National Institutes of Health (NIH), 
opinions are divided about how the geneticist 
will steer the agency through its extraordinary 
funding boom. 

Following President Barack Obama's long- 
anticipated nomination of Collins on 8 July, 
Harold Varmus, NIH director from 1993 to 
1999, and now president and chief executive of 
the Memorial Sloan-Kettering Cancer Center 
in New York, described him as “a terrific scien- 
tist, inspirational leader, superb manager, and 
adept politician. I have great confidence in his 
ability to lead the NIH in complicated times.” 

But some believe that Collins has focused 
too much on genomics to the detriment of 
other approaches to improving health. Collins 
directed the NIH’s National Human Genome 
Research Institute (NHGRI) for 15 years until 
2008, led the public effort to sequence the 
human genome and wasa co-discoverer of the 
gene responsible for cystic fibrosis (see Nature 
460, 164-169; 2009). “The genetic approach is 
extremely important, but it is a tool, it is not the 
be all and end all,” says Fran Visco, president of 
the Washington-based National Breast Cancer 
Coalition, which has called on the NIH to back 
more research on the environmental causes of 
breast cancer. 

There are also concerns about whether 
Collins’s very public expressions of his evan- 
gelical Christian faith will affect his job. He laid 
out his beliefs in a 2006 book, The Language 
of God: A Scientist Presents Evidence for Belief, 


and last year founded the BioLogos Founda- 
tion, which aims to help Christians integrate 
their faith with contemporary science. 

Steven Pinker, a psychologist at Harvard 
University, says he has “serious misgivings” 
about the nomination. “Collins is an advocate 
of profoundly anti-science beliefs, and it is 
reasonable for the scientific community to ask 
him how these beliefs will affect his adminis- 
tration of the NIH and his efforts on behalf of 
the scientific enterprise.” 

Several scientists close to Collins say that his 
faith will not affect his NIH duties. “In all the 
years that I worked with Francis — and there 
were alot of years — I never once saw evidence 
where his religious faith was in conflict with 
his scientific judgement,’ says developmen- 
tal biologist Shirley Tilghman, president of 


Francis Collins has been 
a staunch advocate of 
genomic medicine. 


Princeton University in New Jersey, who sat 
on Collins's advisory council when he directed 
the NHGRI. 

“We would count him as an ally,’ adds Joshua 
Rosenau, a policy analyst at the National Center 
for Science Education, a non-profit organiza- 
tion in Oakland, California, that defends the 
teaching of evolution in schools. “It is helpful 
to have scientists like Francis Collins speak- 
ing out about how they personally reconcile 
science and religion.” 

The BioLogos Foundation has confirmed 
that Collins would step down from his role 
there before taking up the reins at the NIH. 
It’s the right move, says Varmus. “Discussion 
about the foundation and his involvement with 
it could readily become a distraction from the 
business of running the NIH,” he told Nature. 


P. FRANZ/AP 


Malaria drug-makers ignore WHO ban 


There is a growing risk that malaria 
parasites will develop resistance 

to artemisinin because almost 

half of both its manufacturers and 
malaria-affected countries are 
failing to comply with World Health 
Organization (WHO) demands 

to sell it only in combination with 
other drugs. Artemisinin and 

its derivatives are the leading 
treatments for the disease, being 
the only antimalarials that have not 
yet seen widespread resistance in 
malaria parasites. 
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The full scale of the problem is 
revealed in a soon-to-be-published 
WHO briefing seen by Nature, “Stop 
the marketing of oral artemisinin 
monotherapies”, which calls for 
governments to empower national 
drug-regulatory authorities to clamp 
down on offending companies. 

Treatments that use only 
artemisinin need to be taken for 
seven days to kill all parasites, 
but patients often stop treatment 
after a few days when they begin 
to feel much better. This leaves 


the remaining parasites in contact 
with low levels of the drug — a 
recipe for resistance. The WHO 
recommended in January 2006 that 
artemisinin should always be given 
in combination with other drugs 
for at least three days, becausea 
cocktail reduces the chances of 
resistance. The need to move away 
from monotherapies has become all 
the more urgent with recent reports 
of resistance arising in Cambodia. 
Although artemisinin-based 
combination therapies (ACTs) 
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have become the treatment of 
choice for malaria, with a three-day 
programme curing more than 95% 
of patients, monotherapies are 
cheaper to produce and sell. 

Of the 69 manufacturers of 
artemisinin monotherapies that 
the WHO has identified, 21 have 
withdrawn monotherapies, and 14 
say they intend to comply with the 
WHO's recommendations. But the 
remaining 34 have not yet disclosed 
their intentions. Many have not even 
replied to multiple WHO e-mail 


APPROVALS ON TRIAL 

European rules for clinical 
studies need streamlining. 
www.nature.com/news 


PUNCHSTOCK 


Collins takes over an NIH awash with cash, 
yet bracing for leaner times after a one-time 
bolus of US$10.4 billion in economic stimulus 
spending runs out in September 2010. Spending 
the money both rationally and quickly is already 
proving a daunting challenge for the NIH — the 
world’s largest funder of biomedical research 
— whose peer-review system is close to being 
overwhelmed by a flood of applications (see 
Nature 459, 763; 2009). “Everybody and their 
brother is going to have an idea of how to do 
this,’ says Tilghman. “He's going to have to sift 
through all those ideas and plans and ultimately 
do what is best for the United States. And he is 
going to have to do it relatively quickly” 

Since Obama's 2010 budget requested only a 
1.4% increase for the $30.3-billion agency, the 
NIH could also face the kind of boom and bust 
it confronted in 2003, when its rapidly expanded 
ranks of scientists were left competing for slices 
of a suddenly very finite pie. “He's going to have 
to fight for budget,” says Paul Nurse, the Nobel- 
prizewinning biologist who is president of the 
Rockefeller University in New York. 

The NIH is a behemoth that now has 27 insti- 
tutes and centres and about 18,000 employees 
— much larger than the more focused NHGRI. 
‘A big challenge for him is going to be to shift 
into a new role from his old role,” says Elias 
Zerhouni, who left the NIH director's job last 
October. “You can't just dictate what you think 
is right. You have to listen a lot more.’ 

White House rules prevent Collins from 
speaking to journalists before his nomination is 
considered by the Senate Committee on Health, 
Education, Labor, and Pensions, chaired by Sen- 
ator Edward Kennedy (Democrat, Massachu- 
setts). Kennedy has called the choice of Collins 
“inspired”, and said he would work to see that 
the nomination is approved “without delay”. = 
Meredith Wadman 


and fax requests for information, says Andrea 
Bosman, an official at the WHO's Global Malaria 
Programme (see http://tinyurl.com/m4gqmn). 

Regulatory authorities in just 39 of the 76 
countries using malaria drugs have so far complied 
or said they intend to comply with the WHO's 
recommendations (see http://tinyurl.com/ 
m94wgk). National regulatory authorities could 
help by banning monotherapies, says Bosman, but 
many are weak, understaffed and lack expertise. 

“It's terrible," says Bosman, adding that every 
country where malaria is endemic is affected by 
the problem. “Who says there is no profit to be 
made in malaria? When you see the number of 
companies operating in Africa, and the diversity 
of products, you'd just be amazed.” | 
Declan Butler 


Flu furore hits Argentina 


BUENOS AIRES 
Argentina's government has for more 

than two weeks ignored recommendations 
from a committee of influenza experts 

to declare a national health emergency. 
Such a declaration would prompt greater 
federal resources to combat the spread of 
the pandemic H1N1 2009 virus. 

The committee, specifically set up to 
advise the Ministry of Health on swine flu, 
issued its call just days before Argentina’s 
national election on 28 June, which it 
feared could accelerate the spread of 
the virus. 

Following the committee's call for action, 
the health minister, Graciela Ocafa, 
asked for the election to be postponed, 
but her request was rejected. Ocaiia 
subsequently resigned and left office 
on 29 June. Mirta Roses, 
director of the Pan American 
Health Organization — the 
Latin American and Caribbean 
arm of the World Health 
Organization (WHO) — has 
also criticized the decision to 
go ahead with the elections. 

“The delay was 
recommended because the 
agglomeration of people in 
places with little ventilation 
favours the transmission of 
the disease,” says Jorge San 
Juan, coordinator of the 
expert committee and head 


as an epidemiologist, signed a decree that 
gives health authorities in the country the 
power to cancel events that could spread 
swine flu. 

Meanwhile, a group of doctors, hospital 
workers and scientists called Médicos Sin 
Banderas (Doctors without Flags) has 
also accused the national government of 
deliberately hiding the full extent of the 
swine-flu outbreak. 

Ariel Umpierrez, a health economist 
who heads the group, says that its 
members have been sharing information 
about flu cases from their places of work 
across the country. Those data show that 
the government has lowered the official 
number of cases by excluding infected 
people who visit private clinics, he argues. 
Argentina’s Ministry of Health has not 
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Argentines have been travelling abroad to buy flu drugs. 


of the intensive-care unit at 

Mujhiz Hospital in Buenos Aires. The 
government now faces two lawsuits 
brought by individual lawyers who claim 
that its decision not to delay the elections 
recklessly endangered its citizens, who are 
legally required to vote. 

Epidemiologist Emilio Santabaya, 
former head of the Malbran Institute in 
Buenos Aires — the country’s main flu 
monitoring centre — adds that Argentina 
knew an epidemic was coming and did 
nothing, and that flu monitoring in the 
country is not adequate to track the 
spread of the virus. The situation contrasts 
with that of neighbouring country Chile, 
which has many more flu surveillance 
centres, according to Jorge Jimenez, 
who has worked both as Chile’s health 
minister and as chairman of the WHO’s 
executive committee. On 7 July, Chilean 
President Michelle Bachelet, who trained 
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responded to Nature’s queries on the 
matter. 

Although scientists inside the country 
say they are being ignored, those outside 
are worried by reports of panicked 
Argentines travelling to Chile and 
Uruguay to buy flu drugs. The drugs, 
which are being taken without medical 
advice, are probably being used ina 
way that promotes the development 
of resistance, says Eric Toner, a senior 
associate at the University of Pittsburgh’s 
Center for Biosecurity in Pennsylvania. 
Because Uruguay is heavily dependent 
on trade with Argentina, it is highly 
unlikely to prevent Argentines from 
entering the country — but the 
governments of Brazil and Bolivia are 
considering closing their borders to 
prevent the spread of infection. a 
Anna Petherick 
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NEWS 


Medical isotope shortage 
reaches crisis level 


Robust solutions sought urgently to shore up fragile supply chain. 


The worldwide shortage of medical isotopes is 
about to get much worse this week, as the High 
Flux Reactor in Petten, the Netherlands, closes 
for a month-long maintenance inspection. 

It joins the National Research Universal reac- 
tor in Chalk River, Ontario, Canada, which has 
been closed since 15 May because of a heavy- 
water leak and is unlikely to restart before late 
2009, according to Atomic Energy of Canada 
Limited, the government-sponsored body that 
runs the facility. 

Together, the reactors produce two-thirds of 
the global supply of molybdenum-99, which 
decays to form technetium-99m, an isotope 
that is used in about 70,000 medical imaging 
procedures worldwide every day. 

The shutdown has prompted calls for a major 
review of the way that medical isotopes are made 
and distributed. All five commercial nuclear 
reactors that use neutron-induced fission of 
highly enriched uranium-235 to make **Mo 
(see map) are more than 40 years old and cracks 
are beginning to emerge. The Petten reactor is 
scheduled to close again in early 2010 for up to 
six months so that deformed pipes in its cooling 
system can be repaired. 

With both reactors unavailable for signifi- 
cant periods, “you really do have to say: ‘right, 
let’s clear the table and start all over’’, says 
Thomas Ruth, senior research scientist at TRI- 
UME Canada’ national laboratory for particle 
and nuclear physics in Vancouver. 

The problem arises because isotopes cannot be 
stockpiled — Mo hasa half life of 66 hours and 


Uses highly enriched 
uranium-235 targets 


Uses low-enriched 
uranium-235 targets 
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"Tc of just six hours. ”Mo created in nuclear 
reactors must be shipped quickly to facilities 
that process the material into *’Tc generators. 
These generators are delivered to hospitals on 
a weekly basis, where they are ‘milked’ every 
morning to deliver ””’Tc. Although the reactors 
in Belgium and South Africa will be working at 
full power over the next few weeks to produce 
more ”Mo than usual, isotope suppliers say they 
will probably not be able to meet the shortfall. 
The shortage has been particularly acute in the 
United States and Canada, and “the situation 
will get much worse’, says Michael Graham, 
director of nuclear medicine at the University 
of Iowa in Iowa City, and president of the inter- 
national Society of Nuclear Medicine based in 
Reston, Virginia. “It is likely many studies will 
be cancelled or postponed” 


Leaky pipeline 
Week-by-week availability of Mo is now 
coordinated by the Brussels-based Association 
of Imaging Producers and Equipment Suppliers 
(AIPES), which in the past year has taken the 
initiative to stimulate global action on the iso- 
tope crisis. Both the AIPES and the Organisation 
for Economic Co-operation and Development 
(OECD), based in Paris, began to investigate 
long-term solutions to the chronic shortage of 
Mo at the start of 2009. But given the impor- 
tance of ”"Tc in health care, the ageing reactor 
infrastructure and highly fragmented supply 
network needs a root-and-branch overhaul. 
“We must upgrade and invest in updates to 
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these ageing reactors,’ says George Segall, chief 
of the nuclear-medicine service at the Veterans’ 
Affairs Palo Alto Health Care System in Cali- 
fornia. “Many of these reactors have already 
exceeded their useful lifetime. No one expected 
that they would still be needed now when they 
were built” 

The Canadian government has so far resisted 
calls to switch on two Multipurpose Applied 
Physics Lattice Experiment (MAPLE) reac- 
tors, built at the Chalk River site to replace the 
52-year-old National Research Universal reactor. 
The MAPLE reactors were to be the first nuclear 
reactors in the world dedicated to production 
of medical isotopes, but are lying dormant after 
Atomic Energy of Canada Limited halted devel- 
opment in May 2008, citing safety issues that it 
claimed it would be too expensive to resolve. If 
started up, these reactors alone could deliver 
more than the current global requirement for 
medical isotopes, according to Harold Smith, 
who was a manager on the MAPLE project. 

Some medical-isotope suppliers are looking 
to academic research reactors to fill the gap. 
MDS Nordion, a medical-isotope provider 
based in Ottawa, Canada, that had invested 
heavily in the ill-fated MAPLE project, is nego- 
tiating with the Karpov Institute of Physical 
Chemistry in Moscow, which holds a 90% share 
of the market for ”""Ic generators in Russia. 

Researchers at Germany’s Technical Univer- 
sity of Munich are seeking financial backing 
from the government to support an upgrade to 
the university's FRM II neutron source to create 
Mo. If highly enriched *”U targets were used, 
the reactor could satisfy almost all of Europe's 
needs for” Tc once it becomes fully opera- 
tional, says Winfried Petry, scientific director 
of the FRM II. 

An alternative option, says Ruth, could be 
to fire high-energy electrons at Mo targets, 
forcing out a neutron to create Mo. This 
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Australia's OPAL reactor is the 
only one currently irradiating low- 
enriched uranium. 


process could be carried out in small particle 
accelerators, similar to those already used to 
make medical isotopes such as fluorine-18 for 
positron emission tomography scans. Although 
this strategy would be much less efficient than 
351] fission, it could at least alleviate °’"Tc 
shortages at a local level. 

Another possible pathway is electron-induced 
38U) fission. This, too, would bypass the need for 
nuclear reactors, although it would need to be 
done in dedicated high-powered accelerators 
(T. Ruth Nature 457, 536-537; 2009). 

For now, 7°U fission remains the most 
efficient way to generate Mo in the quantities 
needed by the medical community. However, 
the reliance on highly enriched **U has made 
many governments reluctant to build more 
reactors because of fears that the material could 
be used to make a nuclear weapon. 

Australias OPAL reactor, located in New 
South Wales, is the only site in the world that 
is irradiating low-enriched **U targets, which 
pose less of a proliferation threat and also 
produce less hazardous waste. However, the 
University of Missouri in Columbia is seeking 
funding to upgrade its research reactor, which 
proponents say could start Mo production 
from low-enriched uranium by the end of 
2011. Meanwhile, the reactor-construction firm 
Babcock & Wilcox in Lynchburg, Virginia, in 
partnership with the medical-isotope supplier 
Covidien of Mansfield, Massachusetts, wants 
to build a series of small nuclear reactors with 
a novel liquid core that also relies on low- 
enriched uranium. 

Ultimately, governments now need to secure 
the long-term supply of medical isotopes, says 
Graham. “There needs to be a major effort in 
several countries to either convert existing 
reactors or build new reactors to ensure a stable 
long-term supply of °Mo”” a 
Paula Gould 
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HAVE YOUR SAY 
Comment on any of our 
News stories, online. 
www.nature.com/news 


G8 climate target questioned 


The path to a meaningful 
deal at the Copenhagen 
climate summit in 
December seems more 
treacherous in the wake 
of last week’s meeting of 
the G8 nations. 

Leaders gatheredinthe |COPENHAGEN 
quake-struck Italian town 
of L Aquila promised to try to prevent 
global temperatures from rising more than 
2°C above pre-industrial temperatures. 
They also said that rich countries would 
cut their aggregated greenhouse-gas 
emissions by 80% by 2050, and, with other 
countries, would cut global emissions by 
50% over the same period. But they made 
no commitment to reducing emissions 
any earlier and did not agree on a base 
year against which to measure national 
emissions cuts. 

The choice of base year “matters a great 
deal’, says Gwyn Prins, an economist at 
the London School of Economics. A recent 
year, say 2005, would make 
targets easier to achieve for 
countries such as Canada, 
Japan and the United 
States, whose economies 
and emissions have grown 
constantly over the past 20 years. It would 
be even more attractive for booming 
countries such as China and India. But 
the European Union would benefit from 
a 1990 baseline — as emissions there have 
fallen since then — as would Russia, which 
saw a dramatic industrial decline in the 
1990s. 

Others argue that the 2°C limit is, in fact, 
a crucial commitment because it marks 
the first time G8 countries have agreed 
on what constitutes ‘dangerous’ climate 
change. Two degrees warming translates 
with “mathematical clarity” into the 
required “decarbonization” of the global 
economy, says Hans Joachim Schellnhuber, 
director of the Potsdam Institute for 
Climate Impact Research in Germany. 
A study earlier this year, for example, 
suggested that there is a 50% chance that 
global warming will exceed 2°C by the 
end of the century if around 1,400 billion 
tonnes of carbon dioxide is emitted during 
the period 2000-50 (M. Meinshausen et al. 
Nature 458, 1158-1162; 2009). Emissions 
for 2000-06 alone amounted to around 
234 billion tonnes. 

“We have now a politically accepted and 
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“So much for science 
— the restis upto 
politicians and voters.” 


science-based threshold that allows us to 
calculate precisely how much greenhouse 
gas we can still afford to emit if we 

don’t want to exceed a given probability 

of getting into dangerous territory,” 
Schellnhuber says. “So much for science — 
the rest is up to politicians and voters.” 


Packed schedule 

However, much remains to be done before 
the Copenhagen meeting in December, 
when nations will meet to negotiate 

a successor treaty to the 1997 Kyoto 
Protocol on climate change. 

Last week, China and India reiterated 
their unwillingness to sign up to binding 
emissions cuts. If that stance does not 
alter, it is likely to jeopardize a bill to 
introduce a cap-and-trade programme to 
cut US emissions by 83% from 2005 levels 
by 2050. 

The US House of Representatives 
last month passed its version of the bill. 
But Senator James Inhofe (Republican, 
Oklahoma), a senior 
member of the panel 
drafting the Senate’s version 
of the bill, warned last week 
that “unless supporters of 
cap-and-trade legislation 
can develop a plan to persuade China 
and India to make meaningful emissions 
reductions on par with the United States, 
no such bill will pass the US Senate”. 

Serious questions also remain over 
whether the 80% reduction goal agreed 
by the G8 nations can be achieved using 
‘indirect policy tools such as emissions 
trading and carbon ‘offsetting’ by land- 
use change and forestry projects. The 
European Union's emissions trading 
scheme, for example, has failed to deliver 
substantial emissions reductions since its 
launch in 2005. 

Prins doubts that such policies, 
including cap and trade, will work (see 
G. Prins and S. Rayner Nature 449, 973- 
975; 2007). “We don't need meaningless 
targets, but real action on carbon intensity 
and energy efficiency,’ he says. “But this 
would require leaders admitting there’s 
something fundamentally wrong. ’m 
afraid this won't happen unless the facts 
eventually slap us in the face.” a 
Quirin Schiermeier 


See Editorial, page 307, and www.nature.com/ 
roadtocopenhagen for more climate coverage. 
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Shooting for the Moon 


The Apollo programme inspired thousands of people to pursue careers 
in science. Today, they still support human spacefaring — but baulk at 
the price. Richard Monastersky reports on the results of a Nature poll. 


In the heady days after the Apollo 11 lunar 
landing on 20 July 1969, countless children 
travelled to the Moon by cupping their hands 
over their mouths and announcing: “Hou- 
ston, Tranquility Base, here. The Eagle has 
landed.” With giant strides, they bounded 
across the landscape just like their heroes, 
Neil Armstrong and Buzz Aldrin. They 
practised lift-off by counting backwards 
to zero and blasting themselves skywards. 
And many of them landed, decades later, in 
scientific careers. 

In fact, the Moon landings deserve credit for 
motivating a large fraction of today’s scientists, 
according to a survey of almost 800 research- 
ers who have published in Nature in the past 
three years’. 

Half of the researchers who responded to 
an online poll last month said that the Apollo 
missions had inspired them to pursue science 
— and not just in astronomy or planetary sci- 
ence (see ‘Survey respondents’). “I became 
completely space crazy,” said one life scientist. 
“I was certain I'd be an astronaut. My interest 
shifted to biology, but I still believe Apollo 11 
was a major influence on me?” 

The survey, intended to capture broad 
impressions rather than to precisely measure 
attitudes, also reveals surprisingly widespread 
support among scientists for human space 
exploration. Nearly 80% of scientists polled 
said that there are scientific research justifica- 
tions for continuing human spaceflight. 

More than 80% felt that the life sciences, 
physical sciences, engineering and human 
physiology all benefited to some degree from 
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Life scientist 


human spaceflight, and almost 90% said that it 
still inspires younger generations to study sci- 
ence (see ‘Survey results’). Their impressions 
are not without justification. The US National 
Research Council concluded in 2003 that 
work in microgravity has had a major effect 
on several areas of physical sciences, including 
research into fluids, combustion and crystal 
growth’. The Neurolab experiments, which 
flew on NASAs space shuttle Columbia in 1998, 
are regarded as the high point of life-sciences 
research in space and resulted in 100 publica- 
tions, according to Joan Vernikos, former direc- 
tor of life sciences at NASA. 


Up, up and away 

In principle, the scientists who participated 
in the poll seemed to support manned space 
programmes, with 62% of Americans and 
83% of Europeans rating their own country’s 
expenditures on human spaceflight as “about 
right” or “not enough”. The stronger support 
in Europe perhaps reflects a vast difference 
in spending on either side of the Atlantic. 
Including money provided by the stimulus 
package, NASA will pour US$9.7 billion into 
activities related to human spaceflight in the 
current fiscal year, roughly half of the agency’s 
$18.7-billion budget. The European Space 
Agency (ESA) allocates only €481 million 
($670 million), about 13% of its €3.6-billion 
($5.0 billion) budget. 

But support for sending people to space 
flagged when participants were asked to pri- 
oritize spending. Nearly 80% would welcome 
a shift in spending from human spaceflight to 
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Canada 


Rest of World 
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Humans have not walked 
onthe.Moon since the 
Apollo 17 mission of 1972. 


unmanned space missions, whereas 70% would 
prefer some of that cash to be spent on other 
areas of scientific research. 

David Southwood, director of science and 
robotic missions for ESA, says that the affec- 
tion for shifting money rests on a false premise, 
because money taken from human spaceflight 
would not go to unmanned projects. Indeed, 
written responses from several of those who 
completed the survey revealed the opposite 
sentiment: some supported human spaceflight 
thinking it would increase spending generally 
on space research. 

Southwood says that the positive attitudes 
seen in the Nature poll reflect the broad impact 
of human spaceflight, beyond simply yielding 
data for science. “It gives humans a vicarious 
means of breaking the surly bonds of gravity,” 
he says, borrowing a phrase from the pilot- 
poet John Gillespie Magee. 

During the lead-up to the Apollo missions, 
scientists had similar outlooks to those seen 
in the current survey. A study conducted by 
the American Association for the Advance- 
ment of Science in 1964 found that scien- 
tists generally supported the goal of landing 
humans on the Moon, but they objected to 
the tight timetable and the extreme costs’. At 
the time, the NASA budget consumed 4.3% 
of the total US budget; it now accounts for 


just 0.5%. Scientists in the public eye have 
generally been more sceptical about human 
spaceflight than those in Nature’ poll, says 
John Logsdon, an aerospace historian at the 
Smithsonian Institution in Washington DC. 
“There have been very few scientists publicly 
giving this kind of support for human space- 
flight,” he says. 

With some scientists, the support for space- 
flight was quite personal (see ‘Moondreams’). 
About 5% have applied to be an astronaut and 
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14% said they would pay between $10,000 and 
$50,000 for a brief sub-orbital trip into space. 

It remains to be seen whether NASA's 
space programme will continue to earn their 
enthusiasm. Next month, a panel convened by 
US President Barack Obama will issue its 
assessment of America’s plans to return to the 
Moon by 2020 (ref. 4), and its approval of the 
programme is far from certain. 

Yet more than 36 years after man last ven- 
tured beyond the confines of low-Earth orbit 
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benefited from human spaceflight 
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Moondreams 


“I was a small boy living in provincial New 
Zealand when the Apollo missions were 
going on, and the astronauts and scientists 
behind that remarkable series of missions 
were the one group of people who genuinely 
inspired me.” Earth and planetary scientist, 
Oceania 


“There were no women involved that | 
could see, so | wanted to be the first female 
astronaut, or at least show them that 
women could be scientists too.” 

Life scientist, US 


“We can do everything we need to do using 
robotics much cheaper.” Earth and planetary 
scientist, Europe 


“Putting men on the Moon meant that with 
science, nearly anything is possible.” 
Life scientist, US 


“Without human spaceflight, we will only 
be able to scratch the surface of space 
exploration.” Engineer, US 


“| knew nerds were heroes at NASA. That 
was inspiring.” Physical scientist, US 


during the Apollo 17 mission, Nature’ survey 
would suggest that many will support the next 
grand voyage to the Moon and beyond. a 
Survey work was aided by Sara Grimme. 


1. www.nature.com/nature/newspdf/apollo_results.pdf 

2. Assessment of Directions in Microgravity and Physical Sciences 
Research at NASA (National Academies Press, 2003). 

3. Abelson, P. H. Science 145, 539 (1964). 

4. Hand, E. Nature 459, 1038-1039 (2009). 


See Editorial, page 307, and Nature's Apollo special 
at www.nature.com/apollo. 
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Arms pact could boost 
US-Russian science 


Nuclear scientists in Russia and the 
United States say that the sputtering 
relationship between their weapons 
laboratories could be rekindled by an 
agreement to cut the nations’ nuclear- 
weapons arsenals. 

US President Barack Obama and 
Russian President Dmitry Medvedev 
announced on 6 July that they intended 
to reduce their stockpile of warheads. 
The pair also discussed scientific 
collaboration as a way to strengthen 
ties between the two nations. 

Even during the cold war, US and 
Soviet scientists worked together on 
specific issues, such as the development 
of fusion energy. Wider programmes 
began in earnest after the fall of the 
Soviet Union, when the US government 
stepped in with millions of dollars in 


Presidents Dmitry Medvedev and Barack Obama want to 
cut back their nations’ nuclear-weapons stockpiles. 


funding for scientific collaborations. 

That cooperation remained strong until the 
new millennium, but “has dropped off sharply 
during the past eight years or so’; says Siegfried 
Hecker, a metallurgist at Stanford University 
in California, and former director of the Los 
Alamos National Laboratory in New Mexico, 
one of the two main US nuclear-weapons labs. 

In 2000, Vladimir Putin, Russia’s newly 
elected president, began tightening up secu- 
rity and scaling back access to the nation’s 
military laboratories. Meanwhile, politicians 
in the United States were growing increas- 
ingly sceptical of programmes that directed 
tax dollars into the Russian defence establish- 
ment, says David Holloway, a political scien- 
tist at Stanford University who has studied the 
Soviet weapons complex. 

Programmes to improve the physical secu- 
rity and accounting of nuclear materials have 
survived the cooling political climate, but 
Hecker says that “most of the lab-to-lab scien- 
tific exchanges are gone”. 

Obamas promise to “reset” the US-Russian 
relationship has sparked hopes that these 
exchanges can be renewed. In March, Hecker 
met with other US and Russian scientists in 
Moscow to discuss how to renew strategic and 
scientific cooperation. 

The collaboration is likely to be more 
even-handed than it was in the 1990s because 
conditions at Russian laboratories have 
improved so dramatically, according to Adriaan 
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van der Meer, the executive director of the 
International Science and Technology Center 
in Moscow, which was set up in 1992 to foster 
collaboration between the former Soviet states 
and the West. 

“We need to move to a true partnership,” 
agrees Hecker. Among the various projects 
Hecker would like to see supported are efforts 
to develop ‘safer’ warheads that are more 
resistant to tampering and accidental deto- 
nation, improved detectors for combating 
nuclear smuggling and databases for tracing 
the origin of bomb material. There is also great 
potential for cooperation on civilian nuclear- 
power research, he says. “We have superb 
modelling and the Russians have excellent 
experimental facilities” 

Holloway, who attended the March meeting in 
Moscow, says that scientists from both countries 
are clearly eager to embark on a new cooperative 
relationship. But he notes that much more dip- 
lomatic work must be done before researchers 
can start purchasing plane tickets. 

Russia is currently unhappy with US inten- 
tions to place a missile defence system in 
Europe, for example, while the United States 
has halted a plan for civilian nuclear coopera- 
tion after Russian forces invaded the Republic 
of Georgia last year. Until the two sides begin to 
tackle these larger issues, says Holloway, scien- 
tific collaboration may be slow to blossom. @ 
Geoff Brumfiel 
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Europe attacks tactics that 
delay generic drugs 


The European Commission last week 
promised tougher scrutiny of drug industry 
deals that maximize profits on branded 
medicines by slowing consumers’ access to 
cheap generic drugs. 

It opened an investigation into Servier, a 
pharmaceutical company headquartered in 
Neuilly-sur-Seine, France, and several other 
drug manufacturers, over agreements that 
may have breached anti-trust laws. 

Neelie Kroes, the European competition 
commissioner, also announced the 
final report of her department's 
18-month inquiry into competition in 
the pharmaceutical sector (see http:// 
tinyurl.com/EUreport). The report says 
that ‘company practices’ are delaying the 
market entry of generic drugs, but notes 
that shortcomings in Europe’s regulatory 
framework might be a factor too. 

An industry trade body, the European 
Federation of Pharmaceutical Industries 
and Associations, approved of the reference 
to regulatory obstacles. It says that the 
findings show a “welcome shift away from 
the emotive language” of an interim report 
released last November. 


Obama names next head of 
US Geological Survey 


A California-based geophysicist is set to 
become the first woman to direct the US 
Geological Survey (USGS) in its 130-year 
history. 

Marcia McNutt, now president and chief 
executive of the Monterey Bay Aquarium 
Research Institute (MBARI) in Moss 
Landing, expects her Senate confirmation 
hearing by September, following her 
nomination last week by President Barack 
Obama. The USGS has an annual budget of 
$1 billion and 8,800 members of staff. 

McNutt, an avid horsewoman who has 
run the MBARI since 1997, would also 
become the science adviser to the secretary 
of the interior. 


es 


Marcia McNutt is set to take the reins at the USGS. 


Mars Science Laboratory devours budgets 


A further budget overrun 

for NASA's Mars Science 
Laboratory rover (pictured), 
due to launch in 2011, could 
for the first time delay other 
missions in the agency's cash- 
strapped planetary-science 
division. 

In areport due to be handed 
to Congress by the end of 
July, NASA will announce that 
the mission needs between 
$15 million and $115 million 
more than its estimated price 


NASA/JPL-CALTECH 


tag — which, at $2.28 billion, is already 40% above an official $1.63-billion estimate made in 2006. 
The agency has so far raided technology-development funds within the Mars programme, but 

if additional costs rise too much, it may have to delay two planned Moon missions. “The time for 

some tough decisions is here,” said NASA science chief Ed Weiler, breaking the news to planetary 

scientists at an advisory-committee meeting on 9 July at NASA headquarters in Washington DC. 

Weiler also confirmed, as expected, that NASA would work in partnership with the European 


Space Agency on all future major Mars missions. 


For a longer version of this story, see http://tinyurl.com/marscash 


EI Nifio arrives in the 
Pacific Ocean 


US climate scientists last week announced 
the arrival of El Nijio, a cyclical rise in sea- 
surface temperatures in the central and 
eastern Pacific Ocean. The event is known 
to influence fisheries and global weather 
patterns. 

El Nifios, which are associated with 
a weakening in the easterly trade wind, 
occur every two to five years and typically 
last for about a year. The current one is 
expected to continue developing over the 
next several months and to last through the 
winter, according to the National Oceanic 
and Atmospheric Administration in 
Washington DC. 

Previous El Nifios have seen more rainfall 
over the central tropical Pacific, drought in 
Indonesia and powerful winter storms in 
California as well as flooding and mudslides 
in Central and South America. The 
phenomenon has also been linked to less 
hurricane activity in the Atlantic Ocean and 
additional winter precipitation in the arid 
southwestern United States. 


German molecular 
biologist to head EMBO 


Maria Leptin was last week appointed 
the new head of the European Molecular 
Biology Organization (EMBO). She is only 
the fifth director in the 45-year history of 
the organization, which has more than 1,300 
members. 

Leptin, who works on cell shapes and 
immunity at the University of Cologne, 
Germany, says that she hopes EMBO will 
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improve connections with India, China and 
the United States, and that it can influence 
governments on policy matters such as 
stem-cell research. 

She will move to EMBO’s base at the 
European Molecular Biology Laboratory in 
Heidelberg, Germany, in January next year. 
For an interview with Maria Leptin, 
see http://tinyurl.com/Leptininterview 


US renewable energy 
gets cash boost 


The Obama administration announced 
plans on 9 July to pump billions of dollars 
directly into the US renewable-energy 
market, providing a new avenue for 
funding that had dried up during the 
banking crisis. 

The programme allows developers to 
receive cash payments, in lieu of existing 
tax subsidies, for wind, solar and other 
renewable-energy projects. Until now, 
such payments had tended to flow through 
banks, which provided upfront financing in 
exchange for tax credits. The new approach, 
sought by the renewable-energy industry 
and backed by Congress in its February 
stimulus bill, allows direct access to the 
subsidies. 

The Department of Energy plans to 
begin accepting applications in August, 
and estimates that demand could exceed 
$3 billion, enabling some $10 billion to $14 
billion in projects. 


Correction 

The News Feature ‘One gene, twenty years’ (Nature 
460, 164-169; 2009) incorrectly stated Robert 
Beall's current title. He is, in fact, president and 
chief executive of the Cystic Fibrosis Foundation. 
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Cellular life is all slopes, arcs and circles.— but there is much 


debate about how these curves are built? 


o view the innards of a cell is to view 

architecture reminiscent of Antoni 

Gaudi: the gentle arc of the cell mem- 

brane, the contortions of internal 
tubing, the tight bubbles of vesicles. But for 
biologists, this architecture is an intellectual 
puzzle as well as a beautiful structure. Mem- 
branes generally prefer to be flat — so what, 
exactly, is generating all the curves? 

Pietro De Camilli thought he had found one 
answer when, just over a decade ago, he saw a 
tangle of tubules in an electron micrograph. 
The pictures came from an experiment carried 
out by Kohji Takei, De Camilli’s postdoc at Yale 
University in New Haven, Connecticut. Takei 
had mixed a protein called amphiphysin, which 
is thickly clustered at the tips of neurons, with 
large bubbles of artificial membranes to try to 
replay the process by which neurotransmitter 
chemicals are packaged up. It came as a surprise 
when the membranes snapped into masses of 
jumbled, twisted tubes’. “It looked like a plate 
of spaghetti, it was absolutely spectacular,’ says 
De Camilli. “It was the first example of a pro- 
tein that, by itself, had dramatic membrane- 
deforming properties,” he says. 

The first, maybe — but not the last. Since 
this discovery, De Camilli and others have 
identified whole families of protein that have 
equally dramatic abilities to bend membranes, 
and a tightly knit community of researchers 
has built up to study them. Much debate has 
centred on whether the proteins create curves 


318 


by wedging themselves into membranes, or by 
moulding them into shape using curved pro- 
tein scaffolds. As it turns out, both may be at 
play — and more. 

‘The issue is not just one of beautiful archi- 
tecture: the ability to turn membranes into 
circles and tubes is central to almost every cel- 
lular process. Spherical vesicles are essential for 
carrying construction materials and communi- 
cation signals around the cell. And the undulat- 
ing network of membranes that 
makes up the mitochondria, 
endoplasmic reticulum (ER) 
and Golgi complex are vital to 
these organelles’ roles in energy 
production, protein synthesis 
and protein processing, respec- 
tively. Some researchers are now 
finding that, once built, the cell 
uses its curves to position cellular processes. 
“The fact that the shape can also be informa- 
tion is very exciting,” says Bruno Antonny, a bio- 
chemist at the CNRS Institute of Molecular and 
Cellular Pharmacology in Valbonne, France. 
“That you can sense curvature means that you 
can organize reactions in time and space.’ 

Walling off spaces by wrapping membranes 
around them was also necessary for the evolu- 
tion of cellular and organismal complexity, as it 
allowed cells to adopt specialized functions and 
trafficking to occur between compartments — 
the hallmarks of eukaryotic cells, says Harvey 
McMahon of the Medical Research Council 
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endall Powell reports. 


Laboratory of Molecular Biology in Cambridge, 
UK. “Evolution from the primordial single- 
celled organisms to multicellular organisms 
was made possible by the appearance of small 
membrane-bound spaces in the cell? he says. 


A matter of scale 
The closest that most cells come to having 
straight lines are the surfaces of the encircling 
plasma membrane. On tightly packed and col- 
umn-shaped cells, these are vir- 
tually flat; and even on curved 
cells, a particular spot on the 
circumference still seems level 
to the molecules there, much 
like the surface of Earth does to 
a person standing on it. Inside 
the cell, however, microscopy 
has revealed a landscape with 
much sharper contours. Yet for many years 
researchers gave little thought to how such 
curves were generated: cell membranes were 
thought of as fluid, pliable structures that could 
easily be pushed or pulled into shape. 

That view changed in the early 1970s when 
physicist Wolfgang Helfrich, one of those behind 
the invention of the liquid-crystal display, pro- 
posed a model showing that membranes are 
relatively rigid structures that require substan- 
tial energy to contort’, just as a liquid crystal 
does. Cell membranes are made up of lipid 
bilayers: two tightly packed rows of lipid mol- 
ecules lined up with their hydrophilic lipid head 
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groups facing outwards, and water-repellent 
fatty-acid chains sandwiched on the inside. 
Work by Helfrich and other biophysicists has 
since shown that lipids prefer to be lined up in 
planes, and it takes a lot of energy to create the 
kind of disturbance that comes with bending a 
plane into a cylinder or sphere. The situation is 
similar to blowing a soap bubble from a circular 
wand — it takes continuous blowing to deform 
the flat soap film into a sphere. 

Researchers spent many years looking for 
scaffolding proteins that could sustain a mem- 
brane curve, but none emerged that could, 
according to biophysicists’ calculations, pro- 
vide sufficient energy. De Camilli says that it 
wasn't until the 1999 amphiphysin experiment 
that biologists began to appreciate that proteins 
could be ‘massaging’ membranes into shape 
rather than forcing them. What made that dis- 
covery all the more intriguing to De Camilli 
was that amphiphysin is normally found 
floating in the cell cytosol, not integrated into 
membranes. He wondered whether there were 
more proteins that could distort membranes so 
strongly, and how they did it. 

By 2002, De Camilli’s group and others had 
identified a few more proteins that had mem- 
brane-flexing abilities’ ° — and saw that they 
shared an intriguing spiral structure called an 
amphipathic helix. Looking down the barrel of 
the helix, one halfis charged and the other half 
uncharged. Put the protein near a bilayer and 
it will immediately insert itselflengthwise with 
its charged half among the lipid head groups 
and the uncharged half nestled within the 
fatty-acid chains. This suggested that inser- 
tion into the outer half of the bilayer would 
create a wedge, and if enough wedges were 
inserted then the membrane would bend 
(see graphic, overleaf). It was an instantly 
appealing idea and one that De Camilli pro- 
moted — but it was about to be joined by an 
equally appealing rival. 

In 2004, a group led by McMahon 


and his colleague Phil Evans solved the 
three-dimensional structure of the ‘BAR’ 
domain of amphiphysin, which includes the 
amphipathic helix®. The helix didn’t crystallize 
well, but the rest of the BAR domain did — and 
their work showed that it was crescent shaped. 
“We could immediately see this banana shape, 
that, to me, is one of the most beautiful struc- 
tures weve solved? McMahon says. Beauty aside, 
the banana suggested an alternative way for the 
BAR domain to bend membranes. The concave 
face of the banana is positively charged, which 
would make it stick to the negatively charged 
outer surface of membranes and mould them to 
their own curved shape. 

The bananas did not go 
down well at meetings, and 
McMahon recalls more than 
a few scoffs and scowls. The 
idea that they were acting as 
scaffolding seemed too sim- 
ilar to previous scaffolding 
models, which the physi- 
cists had already proved 
wrong. Clearly BAR domains were important: 
evidence was piling up that they were a huge 
family and the various relatives could subject 
membranes to all sorts of bending. But were 
the banana-shaped molecules generating the 
curvature or simply stabilizing or sensing the 
curvature after it had been produced by inser- 
tion of the amphipathic helix? 

The question has inspired vigorous debate 
and some flip-flopping between ideas. 
Although De Camilli says he was originally scep- 
tical of the scaffolding concept, his work now 


Oodles of tubules: 
amphiphysin triggers cell 
membranes to assemble 
into tubes. 
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“We could immediately 
see this banana shape, 
that, to me, is one of the 


most beautiful structures 
we've solved.” 
— Harvey McMahon 


promotes it. Last year, his group, in collaboration 
with Yale colleague Vinzenz Unger, published 
stunning cryo-electron microscopy images of 
BAR domains without amphipathic helices. The 
domains stacked up end to end to form a spiral 
around membrane cylinders much like a bar- 
ber’s pole’, suggesting that scaffolding alone can 
generate curvature — at least at the artificially 
high concentrations present in the test tube. 


Rods and curves 

McMahon now raises an eyebrow at De Camilli’s 
latest model for how thousands of banana 
domains could generate curvature alone. Work 
from his lab has shown that 
this domain is much less 
efficient at forming mem- 
brane tubes in the lab dish 
if the amphipathic helix has 
been lopped off. “This was 
the first demonstration that 
the two functions might be 
joined in one protein,” says 
McMahon. 

Recently, McMahon has started to promote the 
wedging idea. He teamed up with biophysicist 
Michael Kozlov at Tel Aviv University in Israel, 
who calculated that insertion of an amphipathic 
helix alone is energetically sufficient to gener- 
ate curves, whereas scaffolding alone is not. 
The study showed that the most efficient way to 
bend a membrane is to insert a short rod shape, 
similar to the amphipathic helix, into the outer 
layer of the membrane at a depth — and this is 
key — of around one-third of the membrane’s 
thickness*. Their calculations predicted that 
the bending ability of BAR domains can be 

credited solely to the insertion of the 
amphipathic helix. “Now the feeling is 
that this banana-like shape stabilizes 
the curvature that is generated by some 
sort of insertion into the membrane,’ 
says Kozlov. Speaking of McMahon, 
De Camilli says, “Both of us got it par- 
tially right and partially wrong, so it has 
been a humbling experience?’ 

Just as researchers were finding 
something to agree on about BAR 
domains, a whole new hypothesis 
for generating curves burst onto the 
scene. This one came from the lab of 
Tom Rapoport at Harvard Medical School 
in Boston, Massachusetts. In the late 1990s, 
his graduate student Lars Dreier was mixing 
nuclear-envelope membranes with DNA, 
attempting to imitate in a test tube the way that 
the envelope reforms after cell division. In the 
background of the microscope slide, Dreier 
noticed a network of membrane tubules form- 
ing that closely resembled the ER, the main 
site of protein synthesis in the cell. When 
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Rapoport saw it, he says, “I ran around the lab degree of curvature tells the enzyme that there’s 
asking everyone, ‘Did you see this? It’s fantas- BENDING: THE RULES a vesicle that needs uncoating, Antonny says, 
tic! and getting everyone to look through the Two possible ways to bend a membrane: so that it binds only to vesicles and not to other 


insertion of wedges, such as amphipathic helices, 
or moulding around a scaffold. 


curved surfaces. 

By chopping ArfGAP1 into smaller bits, the 
group found that its key curve-sensing stretch 
was an amphipathic helix. “But the chemistry 
of the helix was almost the opposite of what 

you would expect,” he says. Unlike that in 
the BAR domain, the helix in ArfGAP1 has 
almost no positive charge on one side and 
lacks the strong attraction to the membrane”. 
Antonny compares the protein’s behaviour to 


microscope”. The experiment seemed to have 
triggered membranes into self-assembling an 
ER, which is normally continuous with the 
nuclear envelope membrane’. 

The excitement only intensified when Gia 
Voeltz, then one of Rapoport’s postdocs, iso- 
lated a class of proteins responsible for curving 
the tubules in the ER”. Called reticulons, these 
proteins have neither a BAR domain nor an 
amphipathic helix and there is much discus- 
sion as to how they work. They do form a dou- that of a nervous swimmer about to take the 
ble hairpin-loop structure that inserts partway plunge. “It’s as if the molecule is shy and the 
into the membrane, and Rapoport and Voeltz, i O45 5565566406555 $5,596 (959165569059 6610 5910409904 i hhuttte water is not warm enough when the membrane 
along with De Camilli and Kozlov, think that LLL ALLL LAL Li )> is flat. But when you bend the membrane, the 
these hairpins may act as a wedge. (Kozlov has molecule can sense the lipid packing defect 
calculated that the hairpins probably insert to and inserts.” The molecule’s shyness is what 
the magical one-third depth.) makes it a good sensor, he explains. It dives in 
only when the amount of curvature is right and 
gets out when that curvature changes. 

Last year, Antonny’s lab found a similar cur- 
vature-sensing helix that may help to maintain 
the shape of the Golgi complex”, an organelle 
responsible for processing and trafficking 
proteins and other large molecules in the cell. 

an organelle’s tubular shape than a protein The Golgi is a series of flattened membrane 
such as amphiphysin, which inserts tempo- sacs stacked up like pancakes, with transport 
rarily as a budding vesicle forms. She origi- vesicles constantly budding off from its fringes. 
nally thought that the reticulons might stack How it maintains its architecture in the midst 
up to make a “nice little barber pole” scaffold _ with the idea that cells have evolved multiple _ ofall this trafficking has been a puzzle. The new 
around the ER tubes. But now, she’s pretty wedges, with different shapes and bending _ helix “isa start to explaining why the Golgi has 
sure it’s not that simple — and when she pulls __abilities, to fit different purposes. this beautiful architecture’, says Antonny. 
up her lab’s latest cryo-electron microscopy McMahon now says it was clear early on, “that Explaining beautiful — but functional — 
= images of the ER tubes, it’s easy to see why. _ we were going to have a whole repertoire of pro- architecture is after all what the whole field is 
3 In these images, which show the ER's three- _ teins — some driving curvature, some limiting about. “They strike us, they surprise us, but we 
: ~ dimensional structure at nanometre-scale, the _ or stabilizing curvature, and some sensing cur- __ finda remarkably appealing harmony in such 
= tubules look nothing like neat cylinders and __vature”. And Antonny’s group has been focused __ buildings,” says De Camilli of Gaudi’s con- 
2 more like gnarly tree roots with wide and nar- __ on the sensing part of the repertoire. He istry- structions. And cells, he says, are the same. “T 
8 row stretches. Voeltz now thinks that a clus- ing to understand how thebendofamembrane __ think the beauty of great architecture, like the 
& ter of several reticulon molecules may help to —_ canactasa signal, such that its position, camber _ beauty of cellular structures, resonates in us 
5 stabilize the tubes by forming a half-ring at __ or direction can recruit additional proteins toa _ precisely because they build on natural physi- 
= the narrowest points of constriction. This fits _ particular point on an organelle. cal principles.” a 
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The barber pole effect 

Because reticulons are sunk permanently 
into the membrane, Voeltz, who is now at the 
University of Colorado, Boulder, says that 
they and proteins like them might be better 
suited for the long-term maintenance of 
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BAR domains conjure tubules out of 
cell membranes. 
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Africa's Lake Kivu contains 
vast quantities of gas, which» 
makes it both dangerous 
and valuable. Anjali Nayar 
asks whether it is possible 
to tap the gas without 
causing a disaster. 


n late 2001, Mount Nyiragongo in the 
Democratic Republic of the Congo (DRC) 
was growing restless. Plumes of smoke issued 
from the central crater, alarming volcan- 
ologists in the nearby city of Goma. Then, on 
17 January 2002, lava fountained from a fracture 
on Nyiragongo’ southern flank. The molten rock 
snaked down the sides of the volcano and razed 
the centre of Goma, engulfing houses and set- 
ting offa string of explosions at fuel stations and 
power plants. That evening, the lava streamed 
into nearby Lake Kivu, generating a plume of 
water vapour that clouded the area for days’. 
More than 100 people were killed and nearly 
300,000 people fled their homes. The only 
obvious refuge for the displaced people was 
along the shores of the lake. But Kivu poses 
its own threats. Beneath its placid surface, the 
lake contains 300 cubic kilometres of carbon 
dioxide and 60 cubic kilometres of methane. 
A disruption to the lake — such as a bigger, 
closer eruption — could cause a gas burst, 
with potentially deadly consequences for the 
roughly 2 million people who live along Kivu’s 
shores. The risks are hard to quantify, however. 
Although scientists have studied the lake for 
decades, basic details about Kivu and its gases 
are still relatively scarce, and there is now 
debate about how hazardous the situation is. 
The issue is complicated by the lake's economic 
potential. The valuable methane dissolved in 
the water has started a feeding frenzy among 
energy companies working with the DRC and 
Rwanda, the other nation bordering the lake. 


In deals worth hundreds of millions of dollars, 
companies have started to siphon off the 
methane, in some cases working with the very 
scientists who have been assessing the lake's 
hazards. Proponents say that those degassing 
efforts will reduce the risk of gas eruption, but 
some researchers are worried that schemes to 
extract methane could make the situation more 
dangerous if they upset the lake's equilibrium. 
“It could be one of the great remediation 
projects of all time: mitigating a lethal natural 
hazard and at the same time bringing power to 
people who desperately need it,” says George 
Kling, a biogeochemist from the University of 
Michigan in Ann Arbor. “If it 
is done right?” 


An expanding problem 

Lake Kivu lies in the Great 

Rift Valley, where tectonic 

forces are slowly ripping 

Africa apart. That movement brings up molten 
rock, which releases carbon dioxide that seeps 
into the bottom of Lake Kivu. Bacteria convert 
some of the carbon dioxide into methane, and 
other bacteria produce methane by breaking 
down organic matter in the deep waters (see 
graphic, overleaf). 

Kivu is permanently stratified, with layers of 
dense salt-rich water below fresh water at the 
surface. Deeper than about 50-80 metres, the 
lake is anoxic and the concentrations of dis- 
solved carbon dioxide and methane increase 
with depth’. The differences in density prevent 
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the layers of water from vertically mixing, and 
so trap the gases at the bottom of the lake. 
Residents around the lake have known about 
the dissolved gases for many decades, but it 
wasn't generally thought to be a hazard. Then, 
in 1984, carbon dioxide erupted from Lake 
Monoun in Cameroon, killing 37 people. Two 
years later, another Cameroonian lake, Lake 
Nyos, spat up 0.3-1 cubic kilometres of carbon 
dioxide, asphyxiating more than 1,700 people. 
Kling was part of a team that visited Lake 
Nyos in the weeks following the eruption. 
“The animals were all dead, thousands of cat- 
tle just lying about,’ he says. Kling had been to 
Nyos the year before, but had 
only sampled surface waters. 
“We knew nothing about the 
gas bomb in the bottom of the 
lake,” he says. 
It turned out that the deep 
waters of Lake Nyos were nearly 
saturated with carbon dioxide and, like in Lake 
Kivu, the gases were kept in solution by the pres- 
sure of the overlying water. Kling postulates 
that a landslide disturbed the lake's stratifica- 
tion, forcing gas-rich waters to move upwards’. 
That started a chain reaction. The reduction in 
pressure caused carbon dioxide to come out of 
solution and form bubbles, much like what hap- 
pens when a bottle of champagne is uncorked. 
The rising bubbles dragged up the surrounding 
water, which also degassed, leading to a violent 
gas burst — a limnic eruption. 
Carbon dioxide is denser than air so, when 
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THE UNDERWATER THREAT 


If the dissolved gas accumulating in the depths of Lake Kivu bursts to 


the surface, it could threaten millions of lives. 
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it emerged, it hugged the ground, smothering 
everything as it spread up to 26 kilometres 
from the lake. The scale of the disaster com- 
pelled scientists to assess the risk at Kivu, the 
other lake known for its dissolved gases. 


Latent threat 

There are no historic records of limnic 
eruptions in Lake Kivu. But gaps in layers of 
plankton fossils at the bottom of the lake sug- 
gest that such paroxysms have struck several 
times in the past 5,000 years*. If Kivu were 
to undergo a limnic eruption soon, it would 
dwarf the Nyos disaster. Kivu is more than 
3,000 times larger and contains more than 
350 times as much gas as was released by Lake 
Nyos. Kivu’s shores are also densely populated. 
“Kivu is basically the nasty big brother of Nyos,” 
says Kling. But there is no scientific consensus 
on the current risks of that kind 
of a disaster. 

Gas concentrations measured 
in 1974 and a limited study con- 
ducted in 2004 (ref. 5) show that 
there has been a 15-20% increase 
in methane and a 10% increase 
in carbon dioxide levels in the lake in the past 
30 years, says Martin Schmid, a researcher 
at the Swiss Federal Institute of Aquatic Sci- 
ence and Technology in Kastanienbaum. If 
this trend continues, the lake will be saturated 
within the century and, like Lake Nyos, it could 
erupt with even the slightest disturbance. At 
the moment, however, the closest Kivu comes 
to gas overload is at a depth of 330 metres, 
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“I'm not afraid 
of swimming in 


Lake Kivu.” 
— Martin Schmid 
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where the water is 
55% saturated — 10% 
with carbon dioxide and 
45% with methane — says Schmid, so 

an eruption is less likely. (Methane contributes 
most of the gas pressure and the risk of eruption 
because it is less soluble than carbon dioxide.) 
According to modelling work, only an intense 
eruption in the gas-rich depths would be pow- 
erful enough to overturn the lake*. The 2002 
eruption barely affected its stability because 
the magma did not reach those depths’. “The 
probability is low,” says Schmid. “At least Iam 
not afraid of swimming in Lake Kivu.” 

But Dario Tedesco, a volcanologist from the 
Second University of Naples in Caserta, Italy, 
who is writing the DRC’s Mount Nyiragongo 
eruption contingency plan for the United 
Nations, says an eruption at the bottom of 
Lake Kivu isa possibility. Tedesco 
has studied bathymetric surveys 
of the lake and found cone-like 
structures that are probably vol- 
canic in origin, he says. 

That evidence matches some 
other signs. During the 2002 
Nyiragongo eruption, new fractures opened 
on the south side of the volcano, just a few 
kilometres from the lake. The lava from the 
fractures was also compositionally different 
from the volcano’ crater lake’, suggesting that 
there are separate reservoirs of magma in the 
region, some of which could extend under the 
lake. “Nyiragongo is going to erupt again,” says 
Tedesco. “The only real question is where.” 
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W Lake 


Tedesco’s research at Kivu highlights its 
complexity. The lake contains at least five 
basins, with different characteristics and dif- 
ferent probabilities of turnover’. The Kabuno 
basin, in the lake's northwest corner, has high 
gas concentrations only 12 metres below the 
surface. An eruption in Kabuno could release at 
least three times more gas than Lake Nyos did. 

Researchers agree that it is important to 
relieve gas pressure at Lake Kivu to avoid a 
natural disaster — and the economic incen- 
tives are pushing that work forwards. The 
60 cubic kilometres of methane equals roughly 
ten times the combined annual commercial 
energy needs of both the DRC and Rwanda. 
Tapping that reserve is particularly 
attractive to energy-starved Rwanda, 

and Lake Kivu has become the 

centrepiece of the country’s plans to 
expand electricity production. 

Extracting methane from Kivu 

is not a new idea. A brewery in 

Rwanda burned the lake’s methane 

to heat its boilers for 40 years before 
it shifted to electricity. But com- 
mercial interest in using methane to 
generate electricity only burgeoned in 
recent years, in part because of growing 
political stability in Rwanda. Around 60 com- 
panies, most of them from foreign countries, 
have approached the government for access to 
the lake since 2005, says Albert Butare, Rwan- 
da’s minister responsible for energy, water and 
sanitation. 

Rwanda has already handed out methane 
concessions totalling hundreds of megawatts 
to five consortia, including a US$325 million, 
100-megawatt deal with ContourGlobal, an 
energy company based in New York. And 
in June, Rwanda and the DRC announced 
a new joint plan to develop an additional 
200 megawatts from the lake. 

The growing interest in Kivu has kept 
scientists busy discussing best practices for 
methane removal. Most of the proposed models 
use a floating platform to suspend a vertical pipe 
down into the gas-rich layers. A small pump 
initially pulls up some of the bottom water to 
lower pressures, until it becomes saturated 
with gas and starts forming bubbles. After this 
priming, the bubbling drives water up the pipe 
without additional pumping. The extraction 
works like a controlled limnic eruption. 

The methane, being less soluble than the 
carbon dioxide, comes out of solution first. It 
is then piped onshore where it is burned to 
generate electricity. The problem is what to do 
with the carbon-dioxide-laden water. 

From the standpoint of safety, it would be 
ideal to extract the carbon dioxide and reinsert 
the degassed water into the deep parts of the 
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Close call: lava from Mount Nyiragongo didn't go deep enough into lake Kivu in January 2002 to set off a limnic eruption, but next time it could. 


lake, where it wouldn't disturb the equilibrium, 
says Kling. But removing the carbon dioxide 
makes the water lighter, hence less stable at 
depth. “We don’t want to generate any sink- 
ing or rising of water masses that will cause 
mixing,’ he says. Only water from the very 
bottom of the lake would still be dense enough 
to be reinserted in the deepest layers once it 
was degassed, but that would be prohibitively 
expensive. So Kling suggests keeping the car- 
bon dioxide in the water. “Every solution is a 
compromise,” he says. 


Conflict of interests? 

Kaus Tietze, a geophysicist and the director 
of Physik-Design-Technology, a consulting 
company based in Celle, Germany, argues that 
with carbon dioxide concentrations in the lake 
increasing at a rate of more than 3% a decade, 
“leaving the carbon dioxide in the lake is a very 
bad solution”. He is pushing for removing both 
methane and carbon dioxide 

as quickly as possible and 

reinserting the water above 

the main gas accumulations 

in the lake, so that it doesn’t 

dilute the methane resource. 

Schmid discounts that plan, 
however, because the nutri- 
ents in the degassed deep water would overload 
the lake’s upper layers. 

With so much disagreement among scientists, 
the best way forwards for developers is unclear, 
especially because the extraction technology has 
been problematic. So far only one 4-megawatt 
platform, called KP, is sporadically generating 
electricity for Rwanda’s national grid. Another, 
a3.6-megawatt project funded by the Rwandan 
Investment Group, sank last year a week before 
it was scheduled to begin production. Some 
have attributed the loss to sabotage but others 
have blamed bad engineering. 


Complicating the situation is the potential 
for conflicts of interest; some of the scientists 
who studied the lake and identified its hazard 
are now involved in the methane extraction 
projects. 

For example, Michel Halbwachs, a recently 
retired physicist from University of Savoie in 
France, has spent two decades studying Nyos 
and Kivu and now spearheads the Rwandan 
Investment Group's project. He says that the 
engineering of the sunken platform was sound 
and the company is now rebuilding it. Hal- 
bwachs says that his previous work on the lake 
is not a conflict of interest, but rather makes 
him uniquely knowledgeable about how to 
extract gases safely. 

Tietze agrees that only the few scientists 
who “know the whole lake” through decades 
of experience are qualified to run the methane 
projects. He is also trying to get into the extrac- 
tion game and is now looking for investors. 

But other scientists worry 
that the safety of the lake 
could be compromised if the 
researchers who are assessing 
the gas hazard are also work- 
ing on commercial methane 
extraction. The best interests 
of the lake's two million inhab- 

itants could get lost along the way, says Tedesco, 
who calls the plans for tapping methane “pure 
business”. “This has nothing to do with the 
hazard of the situation,’ he adds. 

“Tt’s a little bit like someone has dropped free 
money on the street and everyone's running 
around trying to gather it up,’ says Kling. He 
is part of an international team that is working 
with the World Bank and the involved nations 
to create rules for methane extraction in the 
lake. The team wants every project to be moni- 
tored through a local institute, which then 
reports to an international group of experts. 
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“There has to be a separation between who 
is doing the work and who is checking the 
work,” he says. In mid-June, the team came 
out with its first draft of legislation, which will 
now have to get parliamentary backing both in 
Rwanda and in the DRC. 

Although, in theory, removing the methane 
should make the lake safer, it remains unclear 
whether the hazard will be reduced. The plans 
for degassing are preliminary, and no one 
knows how they will affect the lake’s stability. 

The chances of avoiding a disaster depend on 
many factors, some well beyond the control of 
the scientists or even the governments in that 
region. Armed militias in the DRC recently 
took over four of the area’s seven seismic 
monitoring stations, hampering the ability of 
volcanologists to predict when Nyiragongo will 
erupt. And in the past few months, renewed 
fighting in rural areas has displaced hundreds 
of thousands of people. 

Goma’s population has almost tripled 
since the 2002 Nyiragongo eruption, to an 
estimated 1.2 million people. There are shan- 
ties crammed into every usable piece of land, 
many constructed from lava blocks from 
the volcano’s last eruption. So as scientists 
and developers fight over Lake Kivu’s meth- 
ane resource, the displaced people remain 
pinned between a volcano, the militias and an 
explosive lake. 

Anjali Nayar is an International Development 
Research Centre fellow at Nature. 
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Sharing different 
mouse strains is not 
always so simple 


SIR — | was disappointed by the 
view expressed in your Editorial 
‘The sharing principle’ (Nature 
459, 752; 2009) that the mouse 
community does not share its 
strains. This is untrue. Most labs 
are very collegial, spending a 
considerable amount of time and 
effort on distributing their mouse 
strains. Although there are a few 
labs that withhold distribution, 
any community may contain such 
individuals. The fact that a mouse 
strain is not found in a repository 
does not mean that it is not being 
shared. 

| was also puzzled by the 
conclusion of May's CASIMIR 
workshop, noted in the Editorial, 
that “the sharing problem urgently 
needs resolution” with regard 
to international mouse gene- 
knockout projects. Such mutant 
alleles will mostly be archived 
as embryonic stem-cell lines. 
Readers should also realize that 
repositories cannot keep all their 
mouse strains live ‘on the shelf’: 
most strains are frozen. The cost 
for a user to have a strain thawed 
is thousands of dollars and it 
takes many months before the 
recovered mice become available. 
This is a big disadvantage for labs 
on tight budgets. With regard 
to funding agencies: in grant 
proposals to the US National 
Institutes of Health, for example, 
applicants are required to write 
a ‘resource-sharing plan’ that 
includes genetically modified 
mice. 

It was suggested that sharing 
avoids duplication of effort. But 
it is essential that more than one 
group generates mutations in 
the same gene as acrosscheck. 
No two labs generate the same 
allele, and every geneticist knows 
that the expression of different 
alleles can lead to very distinct 
phenotypes. 

Your claim that sharing mice 
“has never been easier” is 
questionable, considering all the 
paperwork, health certificates, 
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veterinary screens, special 
serology screens, costs, time 
and logistics involved. This is 
quite different from uploading 
DNA sequences in the comfort 
of your office. 

It would be great if funding 
agencies supplemented grants 
involving the generation of mouse 
strains to cover the costs of 
sending the strains to a repository. 
In these tough financial times, that 
seems unlikely. 

Richard Behringer Department 
of Genetics, University of Texas, 
M. D. Anderson Cancer Center, 
1515 Holcombe Boulevard, 
Houston, Texas 77030, USA 
e-mail: rrb@mdanderson 


For Nature's policy on sharing 
of biological materials, see 
http://tinyurl.com/33fg2r 


Invasion biology is 
a discipline that's 
too young to die 


SIR — Mark Davis suggests in 
his book Invasion Biology that the 
discipline should be reabsorbed 
into general community ecology, 
as Emma Marris reports in her 
Review ‘The end of the invasion?’ 
(Nature 459, 327-328; 2009). In 
fact, invasion biology has amuch 
wider remit than the management 
of a few highly invasive species. 
The comparison of alien and 
native species has contributed 
to a better understanding of 
population dynamics, ecosystem 
function and species evolution. 
The field still has much to teach us 
about biodiversity and ecological 
processes in general, and its 
relevance to conservation policy 
is growing. 

The acceleration of international 
trade and travel has led to 
progressive mixing of biota from 
across the world, introducing 
increasing numbers of species 
to new regions. Although our 
knowledge of the drivers and 
consequences of these invasions 
is still poor, the wisdom of 
distinguishing species in relation 
to their geographical origin is 


confirmed by discoveries of the 
often marked difference between 
alien and native species in 
behaviour, traits and impact (see, 
for example, P. E. Hulme Global 
Ecol. Biogeogr. 17, 641-647; 2008). 
However, the position of invasion 
biology as a separate discipline 
will continue to be questioned 
until we have a better framework 
for understanding the impacts of 
invasive species, as well as for risk 
assessment and strategies to deal 
with them. 

Plurality of discourse is, of 
course, valuable in any science, 
and Davis's book should help 
to instil a balanced perspective 
among scholars in a young 
field that is still defining its own 
unique contribution to ecological 
understanding. 

Petr PySek Institute of Botany, 
Academy of Sciences of the Czech 
Republic, Zamek 1, 252 43 Prihonice, 
Czech Republic 

e-mail: pysek@ibot.cas.cz 

Philip E. Hulme Bio-Protection 
Research Centre, PO Box 84, Lincoln 
University 7647, Canterbury, 

New Zealand 


For cancer, seek 
and destroy or 
live and let live? 


SIR — In his Essay ‘A change of 
strategy in the war on cancer’ 
(Nature 459, 508-509; 2009), 
Robert Gatenby proposes that 
controlling a tumour's growth 
may be more efficient in the long 
term than trying to eradicate it. 
However, it could be some time 
before we can establish whether 
such a switch in treatment 
strategy is clinically beneficial. 

To keep cancer in check, Gatenby 
suggests giving progressively lower 
doses of cytotoxic drugs, rather 
than the maximum-tolerated doses 
used in conventional chemotherapy 
to try to destroy the tumour. 

In fact, oncologists have for 
several years been using daily low 
doses of drugs such as etoposide 
or cyclophosphamide to control 
tumours that do not respond to 
chemotherapy at higher doses, an 
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approach called metronomic 
therapy. Robert Kerbel and Judah 
Folkman and their colleagues 
discovered the potential of 
metronomic chemotherapy for 
restricting tumour blood-vessel 
growth some ten years ago (see, 
for example, G. Klement et al. 

J. Clin. Invest. 105, R15-R24; 2000) 
and Isaiah Fidler and Lee Ellis have 
proposed that cancer is a chronic 
disease and should be treated as 
such (I. J. Fidler and L. M. Ellis 
Nature Med. 6, 500-502; 2000). 

But despite some promising 
results in adult and paediatric 
oncology, this approach remains 
marginal and limited to patients 
in relapse, or to those for whom 
conventional treatment has 
failed. Perhaps this is because a 
metronomic strategy challenges 
the predicted success of 
conventional regimens based 
on dose intensity. 

Paradoxically, many oncologists 
have adopted metronomic 
scheduling with blockbuster drugs 
such as Gleevec, Tarceva and 
Sutent — targeted therapies 
administered daily at low doses — 
even though no one really knows 
whether a classical regimen of 
higher doses every three weeks 
might prove more efficient. 

Gatenby's idea of ‘adaptative 
therapy’, derived largely from 
mathematical modelling, is a tour 
de force. He may be paving the 
way for advocates of traditional 
chemotherapy, targeted therapies 
and metronomic strategies to 
work together in the patient's 
best interest. We hope that 
stakeholders will be prepared to 
test Gatenby’s approach and to 
revisit the idea of treating cancer 
as achronic disease. 

Nicolas André Department of 
Paediatric Oncology, H6pital pour 
Enfants de la Timone, Bd Jean Moulin, 
13385 Marseille Cedex 5, France 
e-mail: nicolas.andre@ap-hm.fr 

Eddy Pasquier Children’s Cancer 
Institute Australia for Medical 
Research, Randwick, 

New South Wales 2031, Australia 


Readers are welcome to 
comment on this topic at 
http://tinyurl.comM4jn45 
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Why we need space travel 


Giovanni Bignami reflects on the people who persuaded him that we must send humans 
beyond Earth's orbit to inspire public and political support for science. 


The seasoned army officer slapped me on the 
back. “So it’s scientists like you who made this 
possible,” he thundered. “And we always called 
you sissies.” 

We were in the officers’ club, a bare room 
with a lousy television set, in a small bar- 
racks in the Italian Alps. It was the early 
hours of 21 July 1969. This captain of 
the Alpini — an elite Italian mountain 
infantry corps — and I were watching 
the fuzzy, black and white images of 
the Apollo 11 Moon landing thanks 
to rickety reception from an ice-axe 
doubling as an antenna on the roof. 

I was a 25-year-old conscript. 
Id been made a junior officer on 
the strength of my physics degree, 
despite my political leanings being 
dangerously in tune with the 
1968 cultural revolution. 

A few months later I was back 
at the University of Milan, ready 
to serve under a much more chal- 
lenging commander: Giuseppe 
Occhialini, a baron of the academic 
community, and a demigod for me. 
He had just put himself in charge of 
Italian science in space after 30 years 
in fundamental particle physics. 

Occhialini gave me three curt instructions: 
I was to go and work on a space astronomy 
project somewhere in the fogs of northern 
Europe. I was to note that all the work would 
take place in English (of which I could remem- 
ber little). I was above all to remember a crucial 
truth — that science in space should be real 
science, not the semi-militaristic mucking 
around with astronauts we'd recently seen 
broadcast around the world. 

The last order was one of 
the few mistakes the great 
man made. But it took me 
decades to work that out. 

Meanwhile, the Vietnam 
War was surely and swiftly 
terminating the Apollo Programme. Few real- 
ize today that NASA’s funding of Apollo, its 
biggest expenditure ever, overlapped with the 
United States undertaking its greatest mili- 
tary effort since the Second World War. After 
putting men on the Moon, Apollo hada futile, 
détente-fuelled fling with the Russian Soyuz 
human-spaceflight programme. Between 1972 
and 1975, NASA unwisely got rid of its few 


“Without astronauts and 
their cheering crowds, 
space science will quickly 
wither into oblivion." 


remaining Saturn V launch rockets developed 
by Werhner Von Braun, an Americanized Ger- 
man physicist. At the time, they were the most 
powerful machines ever built and, alas, we have 
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to re-invent them today. Six long 

years passed, until the shuttle era began 

in 1981, before the American conquerors of the 
Moon had access to space again. 

NASA's backpedalling only confirmed 
my conviction that Occhialini was right: the 
agency seemed to have wasted more than 
a decade and an astronomical amount of 
money. And the situation, 
apparently, was only get- 
ting worse. The shuttle had 
nothing to do with science. 
It was designed to deliver 
materials and astronauts 
into space needed to build 
the Space Station — a Reaganite invention 
designed to crush the ‘evil empire’ of the Soviet 
Union technologically, but disingenuously 
marketed as a means to do science. 

Throughout the 1980s and early 1990s, 
it was a la mode in Europe to join US scien- 
tists in shouting that the planned station was 
robbing us of precious space science. I joined in 
wholeheartedly — but in the late 1980s began 
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to have doubts. My misgivings were triggered 
by the advice of a learned friend, whom I won't 
name in deference to his modesty. 

This man was my mentor when I started 
working as an adviser for the European Space 

Agency in 1984. He instilled in me the idea that 
governments — ofall stripes — will never 
part with the huge sums space demands 
for just ‘doing science. He helped me 
grasp that the funding of space science, 
such as the building of satellites for 
astronomy, happens because projects 
such as the station and its astronauts 
are easy for politicians to understand, 
and hugely attractive to industry. So if the 
whole lumbering station-plus-astronauts 
machine was kept rolling, then space sci- 
ence could ride smoothly on its back, 
more or less unnoticed. And it did 
so very well for two decades. 
As the millennium turned, I 
saw that the already ageing Inter- 
national Space Station was not 
enough. You cart sell the future, 
especially in a time of crisis, on 
something that is decades old. You 
need a new mission to spark enthusi- 
asm — such as sending people to Mars. 

Which brings us to today. The Alpini 
captain was right: it is just ordinary 

people doing science that drive progress. 
Occhialini was wrong to pit science against 
astronauts. Forty years after the first Moon 
landing, it is clear that only a great new pro- 
gramme of space exploration, robotic and 
manned, will carry us forwards. It is the only 
way to keep improving on the beautiful tech- 
nologies we've swiftly turned into services, from 
telecommunications to Global Positioning 
System navigation satellites. 

Without astronauts and their cheering 
crowds to rally us along, space applications 
and space science will quickly wither into 
oblivion. Forty years from now there will 
be no space activities at all if we do not send 
people beyond Earth's orbit, where no one has 
been since 1972. a 
Giovanni Bignami is at the Institute for Advanced 
Study of Pavia (IUSS), Lungo Ticino Sforza 56, 
27100 Pavia, Italy, and Accademia dei Lincei, 
Rome, Italy. He was chairman of the ESA Space 
Science Advisory Committee from 2003 to 2006. 
e-mail: giovanni.bignami@gmail.com 
See also www.nature.com/Apollo. 
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The slow slide towards a new battlefield? 


The cold war saw governments develop international policies to regulate outer space for military and civil 
uses. Loopholes in those policies must now be closed, writes Roald Sagdeev. 


Satellites form an integral part of global infra- 
structure through their use in telecommuni- 
cations, remote sensing and navigation. They 
have also strengthened international secu- 
rity. Indeed, as James Clay Moltz explores in 
his book The Politics of Space Security, space 
regulation has been shaped by defence inter- 
ests ever since the Soviet launch of Sputnik 1 
in 1957. 

Sputnik did not carry any military stigma; 
but, as the first artificial object in space, it 
opened up a potential new battlefield. The 
US administration understood the dangers 
immediately. In 1958, US President Dwight D. 
Eisenhower wrote to Soviet Premier Nikita 
Khrushchev to advocate strategic restraint and 
the right of any country to free access to space 
as part of the common heritage of humankind. 
Yet such advocacy did not exclude the military 
from space altogether; after all, satellite recon- 
naissance from orbit could reduce the risk of a 
surprise attack. 

It is remarkable how the leaders of that 
dramatic epoch chose negotiation and treaties 
over a display of brute force in space. Such a 
spirit was expressed in the National Aeronautics 
and Space Act legislated by US Congress in 1958: 
“activities in space should be devoted to peaceful 
purposes for the benefit of all mankind”. 

Ina decisive step in 1959, the United Nations 
established the Committee on the Peaceful Uses 
of Outer Space to designate the legal framework 
for space-related activities. Ina treaty signed in 
1963, the United States, the United Kingdom 
and the Soviet Union agreed to ban nuclear- 
weapons tests in the atmosphere, underwater 
and in space. The UN’s comprehensive Outer 
Space Treaty, which prohibited the space 
deployment of weapons of mass destruc- 
tion, was adopted in 1967. Yet this treaty left 
open the loophole of placing conventional 
armaments in orbit, 
allowing both super- 
powers to develop 
technologies for anti- 
satellite weapons. 

Moltz dissects the 
debates and battles 
that have taken place 
between those who have formulated space 
policy. He describes, for instance, the episode 
when US President John F. Kennedy got the 
1963 Partial Test Ban Treaty approved by Con- 
gress despite strong opposition — including 
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Will President Barack Obama fulfil his campaign promise to tackle space debris, illustrated here? 


that of physicist Edward Teller. In the seem- 
ingly better times after the cold war, President 
Bill Clinton failed to get the US Senate to ratify 
the Comprehensive Test Ban Treaty, which 
would ban all nuclear tests. 

The peak of success in regulating military 
activity in space was reached in 1972 at 
the signing of the 
Antiballistic Missile 
Treaty by the United 
States and the Soviet 
Union. It was patiently 
championed by the 
US administrations of 
both Lyndon B. John- 
son and Richard Nixon until the Soviet Union 
was persuaded by arguments of strategic sta- 
bility. The treaty prohibited the development 
and deployment of missile defences — such as 
the ‘Star Wars shield’ — in orbit. The regime 
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of deterrence was strengthened and space was 
left militarized, yes, but free of any weapons. 
The treaty even survived Ronald Reagan's later 
obsession with the Strategic Defense Initiative 
for protecting the United States from ballistic 
nuclear attack. 

Steady technological progress made navi- 
gation from space possible, Moltz explains. 
Signals sent by Global Positioning System 
(GPS) satellites, operated by the US Air Force, 
have been freely available since 1995. That ful- 
filled the promise Reagan made in 1983 after a 
Korean Air flight was shot down by the Soviet 
Air Force in an incident allegedly caused by 
navigation error. For the military, GPS pro- 
vides greater navigational accuracy for tar- 
geting weapons; hence sci-fi writer Arthur C. 
Clarke's description of the first Iraqi campaign 
as “the first space war’. 

With George W. Bush’s 2001 arrival in 
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the White House, the mix of technological 
exuberance and triumphalism, born of the 
United States’ superiority in space, started to 
challenge the old paradigm of space security 
based on an international legal regime. The first 
signal of change came in January 2001 froma 
US commission on space security, chaired by 
Donald Rumsfeld before he became defence 
secretary. Warning that the United States could 
one day face a “space Pearl Harbor” — a devas- 
tating surprise attack on its strategically impor- 
tant space assets — the commission members 
wanted to ensure “that the President will have 
the option to deploy weapons in space to deter 
threats to, and if necessary, defend against 
attacks on US interests”. 

Rumsfeld presided over several Pentagon 
activities that prepared arguments for the 
full-spectrum use of force in space. Thus, the 
Air Force Space Command described in its 
2004 Strategic Master Plan the possibility of 
using anti-satellite weapons, or even weap- 
ons that could be deployed in space to target 
objects on Earth. Most controversial was the 
US withdrawal in 2001 from the international 
Antiballistic Missile Treaty, which had by then 
been in force for three decades. 

The existing set of space laws has loopholes. 
It does not restrict conventional weapons in 
space that could destroy or damage orbiting 
satellites. Satellites are extremely vulnerable 
even to primitive attacks, such as being hit at 
high speed by a projectile, and anti-satellite 
weapons exploit this fragility. Such weapons 
were not added to military arsenals during the 
cold war. However, both sides have kept open 
the possibility of developing these weapons to 
gain superiority, while each fearing that the 
other will overtake technologically. 

To close these loopholes, experts are calling 
for the introduction of ‘rules of the road’ for 
orbital traffic, the prohibition of all weapons 
in space, the testing and deployment of anti- 
satellite weapons on the ground and support 
for international arrangements to mitigate 
the growing risk from space debris to assets 
in orbit. 

China and Russia, for example, have tried 
during the past few years to introduce an initia- 
tive to ban space weapons in the UN Confer- 
ence on Disarmament. The United States, with 
almost a lone voice, has blocked this discus- 
sion. China’s test of an anti-satellite weapon in 
2007 increased the pressure to act, and demon- 
strated that not everyone is ready to accept the 
space dominance of one country. 

The collision in February this year of an 
active US telecommunications satellite from 
the Iridium constellation with a long-dead Rus- 
sian one adds an alarm call. Introducing regu- 
lations that require satellites to be de-orbited 


at the end of their active life would need inter- 
national cooperation and exchange of infor- 
mation on orbital traffic and debris. As with 
military threats, the United States has most to 
lose in an unregulated environment because it 
has the greatest presence in space. 

A regulatory regime for orbiting objects can 
be introduced in various ways. An interna- 
tional law to ban space weapons, adopted by 
the UN, would offer a radical solution. In the 
interim, a regime of restraint could be agreed 
among those countries that are active in space. 
A precedent is the Missile Technology Control 
Regime — a voluntary association of 34 coun- 
tries that has been promoted by the United 
States and others since the end of the cold war. 
Governments could also pledge not to place 
weapons in orbit. 

Moltz ends on a pessimistic note: the long 
and difficult political battle to come. Pro- 
ponents of space-based weaponry point to 
risks from North Korea, a nation seemingly 
embracing both nuclear weapons and seek- 
ing orbital capability. Iran’s nuclear and rocket 
development is cited as another potential 
threat. However, the debate is changing thanks 
to the rapid rise of civilian space applications. 


Satellite networks for telecommunications, 
remote sensing and navigation are effectively 
global utilities. Space exploration today is also 
largely carried out by international collabo- 
ration, whether the mission is to Mars or the 
Moon. The presence of the International Space 
Station, which brought together the former 
adversaries of the cold war, itself safeguards 
against space becoming a battleground. 

The rise of Barack Obama to the highest 
political orbit brings new hope. Promises he 
made during his presidential campaign include 
pursuing international negotiations to mini- 
mize space debris and banning the weaponiza- 
tion of space. He has declared that “the United 
States must show leadership by engaging other 
nations in discussions of how best to stop the 
slow slide towards a new battlefield”. Such lead- 
ership is urgently needed. a 
Roald Sagdeev is a Distinguished University 
Professor of physics and director of the East West 
Science Center at the University of Maryland, 
College Park, Maryland 20742, USA. He was 
director of the Russian Space Research Institute 
(1973-88) and science adviser to Mikhail 
Gorbachev (1985-88). 
e-mail: rsagdeev@gmail.com 


The return path to the Moon 


The Seventh Landing: Going Back to the 
Moon, This Time to Stay 

by Michael Carroll 

Springer: 2009. 115 pp. £19.99, $29.95 


In The Seventh Landing, Michael Carroll 
adeptly lays out NASAs strategy for returning 
to the Moon, establishing a permanent out- 
post there, and, if all goes well, pushing on to 
Mars. It is a complex and ambitious undertak- 
ing, requiring a new fleet of launch vehicles, 
an updated crew module, a more versatile 
lunar lander, sophisticated all-terrain vehi- 
cles, souped-up spacesuits, lunar habitats and 
enough acronyms to make your head explode. 
Fortunately, using straightforward reportage, 
coupled with his own stunning space art, 
Carroll brings compelling order to what could 
have been a chaotic romp. 

NASAs goal is to get moon boots on lunar 
soil by the year 2020. But to do so, milestones 
must be passed on schedule. The maiden flight 
of the booster rocket, Ares I, which will loft the 
crew module, Orion, into orbit, is scheduled for 
early 2013. Orion, which will accommodate a 
crew of up to six, may fly its first manned mis- 
sion to the International Space Station by 2015. 
The more powerful Ares V booster will carry 
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the lunar lander, named Altair, into Earth orbit. 
After docking with the crew module, the boost- 
er’s upper stage will haul both Orion and Altair 
to the Moon. Ares V’s maiden flight is sched- 
uled for 2018. Deadlines, hardware and mission 
details are in flux but, Carroll recognizes, “the 
overarching goals, strategies and inspiration for 
the seventh landing will not change”. 

Those sentiments, however, may not go 
unsullied. Even now, under a directive by US 
President Barack Obama, a committee of ten 
aerospace executives, astronauts, engineers, 
scientists and a retired general are taking a 
down-to-earth look at the United States’ space 
exploration strategies (Nature 459, 1038-1039; 
2009). Particularly sobering are the budget 
constraints on completing both the Orion 
module and the Ares V booster. Reservations 
from scientists about the value of a return to 
the Moon have also reached the committee's 
ears. Their report, which chairman Norman 
Augustine asserts will be unblinking in its find- 
ings, will be released in August. 

After looking back at early robotic lunar 
missions such as the Luna and Ranger series 
and then the Apollo programme, Carroll shifts 
the focus from our lunar history to our poten- 
tial lunar future and the justification for that 


327 


OPINION 


NATURE|Vol 460|16 July 2009 


future. Whereas the Mercury, Gemini 
and Apollo programmes were politi- 
cally motivated, the reasons to return 
are more practical. Jim Garvin, chief 
scientist at NASA’s Goddard Space 
Flight Center in Greenbelt, Mary- 
land, gives the most succinct reason: 
“Going to the Moon enables us to 
build a platform for technology that 
is transformational to our society, 
and that’s not just Teflon.” 

As Carroll’s interviews with 
experts show, those of us who 
remain on Earth will probably reap 
the benefits from innovations that 
emerge from the new Moon pro- 
gramme. These include advances 
in communications, materials sci- 
ence, computer technology, robot- 
ics, medical procedures and even 
building materials. 

There are also purely scientific rea- 
sons for returning to the Moon. It is, 
as Carroll writes, “the Rosetta stone 
of the terrestrial planets”, and there 


Carroll's vision of a shuttle craft coming in to land at a Mars settlement. 


reach a favourable alignment for the 
journey back. 

Planetary scientist Chris McKay 
of NASA’s Ames Research Center 
in Moffett Field, California, thinks 
our approach should resemble the 
efforts made to live and work in Ant- 
arctica. “We've been [in Antarctica] 
for 50 years and built a station that’s 
designed to last at least another 30. 
That’s the kind of mentality I would 
like infused into the Moon pro- 
gramme ... and Mars base:” 

Carroll writes passionately that 
missions to the Moon and Mars are 
like instinctual siren calls to humans. 
“It is something larger than our- 
selves, something for the generations 
to come.’ To deny these instincts, he 
argues, is to deny our history and to 
deny our humanity. 

Having won the race to the Moon 
with Apollo 11 in 1969, US public 
interest in pushing on to Mars 
quickly waned. The Seventh Landing 


is still much to be learned about its 

geology and origins. If ice is locked up in the 
permanently shadowed regolith at the north or 
south poles, as many scientists think, it probably 
got there via countless impacting comets. Such 
material would be pristine, deposited when the 
Solar System was in its infancy. A study of these 
volatiles would prove invaluable to understand- 
ing the origins of the Solar System. 

Inherent in the strategy is the long-range 
vision of sending humans to work and live on 
Mars. If we can learn to abide ina lunar envi- 
ronment, we should be able to deal with the 
harsh conditions on the Red Planet. As Carroll 
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describes, however, many in the aerospace 
industry ask: why waste time and money on 
the Moon when those resources could be better 
used for a mission to Mars? This debate will not 
be settled for some time. 

What is not up for debate is the difficulty 
of establishing a long-term base on Mars. Just 
getting humans there will be a challenge. Travel 
times will be up to eight months, and by the 
time a crew lands the launch window to return 
home will practically be gone. Even ifa crew 
stays for only a few weeks, they would have to 
wait another year and a half for the planets to 


The anniversary of the first footstep 
on the Moon is being celebrated in 
an array of new books. Drawing on 
NASA‘ archive of oral histories, and 
declassified Central Intelligence 
Agency material on the space race, 
historian Craig Nelson tells in Rocket 
Men (Viking) the full story of the 
Apollo programme from the crews’ 
training to the first moonwalkers’ 
return as media stars. 

Inhis straight-talking memoir 
Magnificent Desolation (Harmony), 
co-written with author Ken 
Abraham, Buzz Aldrin describes 
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the great journey. More than 

600 million people watched Aldrin 
become the second man to set foot 
on the Moon. But on his return to 
Earth he spiralled into depression 
and alcoholism. 

Other Apollo astronauts describe 
their lunar experiences in Andrew 
Chaikin's Voices From the Moon 
(Viking Studio) and select their 
favourite photographs from the 
mission in Apollo: Through the Eyes 
of the Astronauts (Abrams), edited 
by Robert Jacobs and others. 

NASA‘s official history, Apollo 
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demonstrates that it has never died, 
however. Carroll's enthusiasm is infectious and 
will inspire readers who look back on the the 
Apollo landings as a time when mankind really 
did make giant leaps towards a promising 
future. Maybe we will finish what we started 
40 years ago and set a course for Mars, either 
directly, or by way of the Moon. After all, that 
was the idea in the first place. a 
Jeff Kanipe is the author of Cosmic Connection: 
How Astronomical Events Impact Life on Earth. He 
lives in the Washington DC region. 
e-mail: jeff.kanipe@comcast.net 
For more on Apollo, see www.nature.com/Apollo. 


A shelf of testimonies from the men who walked the walk. 


Expeditions to the Moon, edited by 
Edgar Cortright, is being reissued 
by Dover. It will include first-hand 


accounts from those involved in 
all areas of the mission anda DVD 
with historic space footage. 
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OPINION 


In Retrospect: Calvino’s Cosmicomics 


The soaring imagination of the Italian author abounds in a new compilation of his cosmic fables. 
Mostly written in the age of the space race, they are heavily informed by science, finds Alan Lightman. 


In one of the stories in Italo Calvino’s The 
Complete Cosmicomics, the inimitable char- 
acter named Qfwfgq is gambling with Dean (k) 
yK in the nothingness before time and space 
began. “I don’t want to boast,” announces 
Qfwfq, “but from the start I was willing to bet 
there was going to be a universe, and I hit the 
nail on the head ... We were always gambling, 
the Dean and I, because there was really noth- 
ing else to do, and also because the only proof 
I existed was that I bet with him, and the only 
proof he existed was that he bet with me” 

Wildly inventive, philosophical and play- 
ful at once, addicted to sustained metaphors, 
Calvino is almost unique in his fictions. One 
might compare him to José Saramago or Jorge 
Luis Borges, but few others. In Calvino’s daz- 
zling Invisible Cities, the explorer Marco Polo 
reports to Genghis Khan about each of the 
marvellous cities in the Khan’s empire, the 
descriptions exquisite in their crystalline prose. 
In The Baron in the Trees, a man lives among 
the leaves and branches of trees, viewing the 
world from far above the ground. 

The Complete Cosmicomics combines several 
earlier works: Cosmicomics (1968) and Time 
and the Hunter (1969), both translated by 
William Weaver; four stories from Numbers 
in the Dark (1995), translated by Tim Parks; 
and seven tales from World Memory and Other 
Cosmicomic Stories (1968), newly translated by 
Martin McLaughlin. 

Most of these stories were written between 
1963 and 1968, in the heat of the space race 
between the United States and the Soviet 
Union. They are heavily informed by sci- 
ence, especially astronomy and cosmology 
but also evolution and cell biology. In one of 
the yarns, for exam- 
ple, the protagonist 
is an ageing dinosaur 
who has somehow 
survived the extinc- 
tion of his species 
and hides out, like a 
war criminal, in a tribe of descendant reptil- 
ians called “New Ones. The interloper is, by 
turns, embarrassed, secretive, defensive and 
finally indignant about his origins, falling in 
love and then being rejected by one of the new 
females. In another tale, two men and a woman 
fall through empty space for untold centuries, 
the narrator attempting to steer his trajectory 
closer to the woman and imagining a rivalry 


by Italo Calvino 


The Complete Cosmicomics 


Translated by William Weaver, Tim Parks 
and Martin McLaughlin 
Penguin Classics: 2009. 304 pp. £20 


CALVINO 


LHESOOMPLE TE 
COSMICOMICS 


Colloquialisms nestle happily with scientific 
terminology in Calvino's playful approach. 


with the other man, a moustachioed lieutenant. 
In another, the main character is a mollusc: “I 
try to persuade myself that the two holes I had 
were a mouth and an anus ... Every now and 
then, I was seized by fantasies, that’s true, for 
example, the notion of scratching my armpit, 
or crossing my legs, or once even growing a 
moustache.” 

Many of the stories concern cosmic beings 
who offer witty commentary on the formation 
of the Universe and have the wondrous capac- 
ity to casually wander 
across time and space. 
They exist in the cush- 
iony folds of cosmic 
nebulae, or in empty 
space, or falling in the 
gravitational pull of 
distant galaxies. Calvino makes no attempt to 
be logically consistent or scientifically plausi- 
ble, and herein arises much of the humour and 
charm of the book. Molluscs imagine armpits, 
cold exists before the Universe existed, people 
climb ladders and jump on to the Moon. 

In ‘At Daybreak’ a squabbling family stum- 
bles across “pinnacles, spires, and battlements” 
of condensing gas in a new-formed nebula. In 
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‘As Long as the Sun Lasts, Qfwfq’s grandparents 
trade insults about which stars to set up camp 
near: “And why not the other one? The bigger 
they are, the more I trust them” “Are you mad, 
don't you know what that is? Don’t you know 
about the blue ones? They burn so fast, you 
don’t even notice, and barely a couple of thou- 
sand millennia go by and you've already got to 
start packing” Colloquialisms nestle happily 
with scientific terminology. 

We never know who these cosmic supermen 
are, where they came from, or how they got to 
be where they are. In Cosmicomics, there are no 
houses and chairs, no roads or cars, no banks 
or telephones, no Berlin or London. There is no 
human civilization at all. Yet we do meet intel- 
ligent beings, even families and lovers, attempt- 
ing to make sense of life in its most elemental 
form. It seems that Calvino is attempting to 
fathom how much ofa reality can be created 
without familiar geography of time and space. 
He plays with science the way a found-object 
artist throws together bits of silvered glass, odd 
metal brackets, bits of coloured paper. 

It is impossible not to compare Calvino to 
another of his Italian countrymen, Primo Levi. 
Both wrote imaginative fiction inspired by sci- 
ence. Calvinos Complete Cosmicomics is more 
fanciful and playful than Levi's Periodic Table. 
Levi's characters in the latter are far richer. 
The most interesting of Calvino’ fables, in my 
opinion, are the ones with human drama. The 
least interesting are those in which the author 
goes into long pedagogical riffs on science or 
indulges in pages of philosophical reflection. 
When Calvino brings together human drama 
with his imaginative scenarios, and philosophy 
with his beautiful language, humour and wit, 
he is unparalleled. 

In the story titled “World Memory, an 
organization is entrusted with the job of stor- 
ing all of the world’s knowledge and culture in 
the face of an impending cataclysm. “The duty 
of the director is to make sure that nothing is 
left out, because what is left out is as if it had 
never been.” Fortunately for future generations 
of world civilization, Calvino has been, and he 
will certainly not be left out. a 
Alan Lightman is a physicist, writer and adjunct 
professor of humanities at the Massachusetts 
Institute of Technology, Cambridge, 
Massachusetts 02139, USA. His forthcoming 
book, the narrative poem Song of Two Worlds, will 
be published in October 2009. 
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AGEING 


A midlife longevity drug? 


Matt Kaeberlein and Brian K. Kennedy 


The small molecule rapamycin, already approved for clinical use for various human disorders, has been 
found to significantly increase lifespan in mice. Is this a step towards an anti-ageing drug for people? 


Anti-ageing drugs — compounds that slow 
the hands of time and allow humans to live far 
beyond their natural span — have long been 
fertile ground for science-fiction writers. More 
recently, however, the possibility that such com- 
pounds might exist, and might perhaps even be 
within reach, has gained scientific credibility. In 
this issue (page 392), Harrison et al.' provide 
evidence that pharmacological intervention 
in the ageing process is feasible in mammals”. 
They report that dietary supplementation with 
rapamycin — a compound known to be linked 
to lifespan in invertebrates — significantly 
increases the lifespan of mice. 

The US National Institute on Aging’s Inter- 
ventions Testing Program (ITP) was designed 
to test compounds of interest for effects on 
ageing in mice’. Anyone from the scientific 
community can nominate a compound for 
consideration by the ITP, and selected com- 
pounds are tested in parallel longevity studies 
at laboratories at three sites, providing built-in 
triplicate replication and high statistical power. 
Several compounds have already been tested. 
Of these, rapamycin is the first to robustly 
increase lifespan across all three centres and 
in both male and female mice. 

As is often the case in science, this study 
benefited from a fortuitous accident. Early on, 
the ITP researchers realized that simply adding 
rapamycin to feed failed to maintain high levels 
of the drug, so a specially formulated feed was 
developed in which rapamycin is encapsulated 
for timed release in the intestine. It took more 
than a year to develop the special feed, which 
meant that mice in the first cohort to receive 
rapamycin were 600 days old when supplemen- 
tation was initiated. As Harrison et al.’ note, 
this is “roughly the equivalent of a 60-year- 
old person”. Amazingly, both the median and 
maximum lifespan of these middle-aged mice 
were significantly increased by rapamycin 
supplementation. For instance, rapamycin 
increased maximum lifespan (defined by the 
90th survival percentile) from 1,094 days to 
1,245 days for female mice and from 1,078 days 
to 1,179 days for male mice. This translates into 
a striking increase in life expectancy at the time 


*This article and the paper concerned! were published online 
on 8 July 2009. 


Figure 1| Rapa Nui and rapamycin. Rapamycin, the compound shown by Harrison et al.' to prolong 
lifespan in mice, was first identified in soil samples taken on Easter Island, the Polynesian island famed 
for its rock-carved figures. Rapamycin compounds are used clinically as immunosuppressants for organ 
transplants, as a treatment for advanced kidney cancer, and to prevent narrowing of coronary arteries 
after angioplasty. Easter Island is also known as Rapa Nui, from where rapamycin gets its name. 


rapamycin supplementation was initiated of 
about 38% for females and 28% for males. In 
an ongoing study, mice are being fed rapamy- 
cin beginning at 270 days, with a significant 
increase in survival also being apparent in this 
cohort. 

Rapamycin was first identified as a natural 
product of the bacterium Streptomyces hygro- 
scopicus in soil samples from Easter Island — 
famous for its impressive rock-carved human 
statues (Fig. 1). The compound was selected for 
inclusion in the ITP on the basis of its known 
property as an inhibitor of the kinase enzyme, 
target of rapamycin (TOR). TOR signalling has 
previously been linked to the ageing process in 
invertebrates” °, but until now it had remained 
an open question as to whether TOR signalling 
also has a central role in mammalian ageing. 
The findings of Harrison et al.’ make TOR the 
first protein that has been shown to modulate 
lifespan in each of the four model organisms 
most commonly used to study ageing: yeast, 
worms, flies and mice’. 

How does TOR activity influence ageing? 
Among other functions, TOR promotes 
translation of messenger RNA into protein 
by the ribosome, and inhibits a pathway that 
degrades cellular products in lysosomal vesicles 
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(autophagy) — both of which have been impli- 
cated in ageing in invertebrate species’. Regula- 
tion of mRNA translation by TOR, in particular, 
has emerged as a lifespan-determining pathway 
that is highly conserved between yeast and the 
nematode worm Caenorhabditis elegans. In 
both species, mutations in targets of TOR, such 
as ribosomal S6 kinase (an enzyme involved in 
protein translation), several translation-initia- 
tion factors and multiple ribosomal proteins, 
increase lifespan®. In addition, TOR influences 
cell growth, cell-cycle progression, mitochon- 
drial metabolism and insulin-like signalling. 
Untangling the relative contributions of each of 
these processes to the lifespan extension in mice 
conferred by rapamycin is likely to stimulate 
much interest during the next few years. 

TOR signalling has also received attention 
for its role as a possible mediator of dietary 
restriction. This is defined as a reduction in 
nutrient availability without malnutrition, and 
has long been known to increase lifespan in spe- 
cies ranging from yeast to rodents’. TOR activ- 
ity is reduced by dietary restriction, and genetic 
studies in invertebrate models have linked the 
inhibition of TOR to increased longevity by 
dietary restriction’. For example, a recent study 
in yeast showed that TOR inhibition increases 
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the amounts ofa nutritionally responsive tran- 
scriptional activator Gcn4, and demonstrated 
that this is required for full lifespan extension 
from dietary restriction’. Similarly, autophagy 
must be induced for lifespan to be extended by 
dietary restriction in C. elegans"”. 

On the basis of these studies, it is tempting to 
speculate that rapamycin may be functioning as 
a dietary-restriction mimetic — a small mole- 
cule that provides the benefits of dietary restric- 
tion without requiring a reduction in food 
intake. Like dietary restriction, TOR inhibi- 
tion not only increases lifespan, but also confers 
protection in invertebrate and rodent models 
against age-associated disorders, including car- 
diovascular dysfunction, diet-induced obesity 
and cancer’. Cancer inhibition in particular is 
a hallmark of dietary restriction in rodents, and 
rapamycin analogues are already used clinically 
as a treatment for certain forms of cancer. 

Despite these links, Harrison et al.‘ do not 
strongly favour the idea that rapamycin is mim- 
icking dietary restriction in mice. This is based 
on their data that rapamycin extends lifespan 
without reducing body weight, and when treat- 
ment is initiated during middle age (late-life 
onset of dietary restriction has shown incon- 
sistent effects on longevity in previous studies). 
It is worth pointing out, however, that a true 
dietary-restriction mimetic may not reduce 
body weight if it mimics the signalling events 
(and downstream responses) associated with 
dietary restriction without changing food con- 
sumption. Also, dietary restriction has not yet 
been extensively characterized in mice of the 
genetically diverse background used by Har- 
rison et al., so it is difficult to predict whether 
dietary restriction in these animals would have 
effects similar to rapamycin. Thus, although it 
is premature to say for certain that rapamycin 
is functioning as a dietary-restriction mimetic 
in mice, the known role of TOR in the nutrient 
response, and the genetic relationship between 
TOR signalling and dietary restriction in inver- 
tebrates, make this a reasonable possibility. 

Is this the first step towards an anti-ageing 
drug for people? Certainly, healthy individuals 
should not consider taking rapamycin to slow 
ageing — the potential immunosuppressive 
effects of this compound alone are sufficient 
to caution against this. On the basis of animal 
models, however, it is interesting to consider 
that rapamycin — or more sophisticated strate- 
gies to inhibit TOR signalling — might prove 
useful in combating many age-associated dis- 
orders. Also, as relevant downstream targets of 
TOR are better characterized, it may be possi- 
ble to develop pharmacological strategies that 
provide the health and longevity benefits with- 
out unwanted side effects. So, although extend- 
ing human lifespan with a pill remains the 
purview of science-fiction writers for now, the 
results of Harrison et al.' provide a reason for 
optimism that, even during middle age, there's 
still time to change the road you're on. o 
Matt Kaeberlein and Brian K. Kennedy are in the 
Departments of Pathology and Biochemistry, 
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ATMOSPHERIC PHYSICS 


Cosmic rays, clouds and climate 


Ken Carslaw 


Galactic cosmic rays could influence Earth's cloudiness by creating aerosol 
particles that prompt cloud formation. That possible effect looks to be 
smaller than thought, but the story won't end there. 


Striking correlations have been observed 
between Earth’s cloud cover and the flux of 
galactic cosmic rays entering our atmosphere. 
The decrease in galactic cosmic ray (GCR) 
flux by about 15% over much of the twentieth 
century has led to the hypothesis that GCRs 
could influence climate through their effect 
on cloudiness. This controversial possibility 
is revisited in a paper in Geophysical Research 
Letters by Pierce and Adams’. 

There are several plausible mechanisms that 
could link GCR flux and cloud properties”. A 
leading candidate is the ‘ion-aerosol clear-air 
mechanism, in which atmospheric ions cre- 
ated by GCRs act as nuclei for the formation of 
atmospheric particles. The nucleation of new 
nanometre-sized aerosol particles is observed 
frequently, and in many parts of the atmos- 
phere, and is thought to be a major source of 
cloud-condensation nuclei (CCN) — parti- 
cles large enough for cloud droplets to form 
around them. The link between GCRs and cli- 
mate is therefore plausible because any change 
in GCR-ionization rate might be expected to 
drive changes in cloud-droplet concentrations, 
and hence the amount of solar radiation that 
clouds reflect back to space. 

Atmospheric ions can indeed seed new par- 
ticles*, but two outstanding questions have 
hampered progress. What fraction of nuclei is 
created this way? And what fraction of these 
particles grows large enough to influence CCN? 
To be relevant to recent climate change, it would 
be necessary to show that the decrease in GCR 
flux during the twentieth century could lead to 
significant changes in CCN and clouds. 

In their paper’, Pierce and Adams estimate the 
magnitude of the ion-aerosol clear-air mecha- 
nism. They used a global atmospheric model 
with a detailed treatment of aerosol physics to 
estimate some limiting values of CCN formation 
from changes in GCR flux. Their conclusion is 
clear: CCN concentrations just aren't very sensi- 
tive to the changes in GCRs that have occurred 
during the twentieth century. The authors 
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predict that CCN concentrations will change 
by less than 0.1% between solar maxima and 
minima as GCRs change by 15% — about the 
same as the change seen during the last century. 
They estimate that this change in CCN trans- 
lates into a change of 0.005 watts per square 
metre in solar radiation reflected from clouds, 
insignificant compared with the greenhouse-gas 
warming of 2 watts per square metre or more 
over roughly the same period. 

Pierce and Adams's model is quite sophis- 
ticated in the way it treats the global lifecycle 
of aerosols, from formation at nanometre sizes 
to their eventual growth over days to weeks to 
CCN sizes. But rather than trying to model the 
complex ion-aerosol processes in detail (phys- 
ics that is still incompletely understood), they 
make an upper-limit assumption that all nucle- 
ation is due to ions, thereby circumventing one 
obstacle to making such a global assessment. 

Is this negative result the last word on the ion- 
aerosol clear-air mechanism? Climate modellers 
are always quick to point out that predictions 
can be model-dependent. Certainly CCN may 
be more sensitive to the ion-induced nucleation 
rate in a different model or under conditions not 
explored by Pierce and Adams. But other global- 
model studies** of nucleation suggest that CCN 
are fairly insensitive to the nucleation rate for a 
simple reason: during the time taken for nuclei 
to grow to CCN sizes, coagulation depletes 
particle concentrations — just as raindrops 
are always fewer in number than cloud drops. 
Unless there is some as-yet-undiscovered pro- 
cess that accelerates the growth of a few charged 
nucleiall the way up to CCN sizes, this low sen- 
sitivity is likely to be a robust conclusion. 

Despite this result', it is likely that a cosmic- 
ray—cloud-climate connection will continue 
to be explored, for two reasons. First, scien- 
tists continue to be intrigued by correlations 
between cosmic rays, Earth’s electrical state 
and climate variables (clouds, precipitation, 
drought and so on) on timescales from hours 
to millennia®”. Because the climate displays a 
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Light on stratocumulus: GCR-cloud 
physics will remain a hot topic. 


multitude of cycles on almost all timescales, 
detection of a correlation among climate vari- 
ables usually meets with initial and healthy 
scepticism. But variations in cloud proper- 
ties observed on timescales that are unique to 
GCRs* will always prompt a hunt for a plausible 
mechanism. 

The second reason that GCR-cloud physics 
will remain a hot topic is that we have yet to 
explore all the possible mechanisms. Atten- 
tion may now shift to the ‘ion-aerosol near- 
cloud’ mechanism”. GCR ionization modulates 
the fair-weather conduction current (about 
2 picoamps per square metre) flowing between 
the ionosphere and Earth, thereby altering the 


charge that has been observed to accumulate 
around cloud layers. Just like static electricity, 
this charge can influence how cloud drops stick 
to aerosol particles. If the particles are effec- 
tive nuclei for ice formation, then GCRs may 
influence cloud glaciation and precipitation. 
And the charge on some aerosol particles in 
the near-cloud environment could possibly 
become large enough to influence the forma- 
tion of cloud drops directly’. But our under- 
standing of the relevant physics is incomplete, 
and it will be some time before global-impact 
investigations along the lines of Pierce and 
Adams's study can be made. 

Some would argue that the link between 


cosmic rays and climate is just too tenuous to be 
worth pursuing. Others would point out that, by 
ignoring the fact that the atmosphere is actually 
adilute plasma (that is, is weakly ionized), we are 
missing some potentially important cloud phys- 
ics — and clouds are a very large lever by which 
to influence climate. Despite the controversy, it is 
clear that the study of cosmic rays in our climate 
system has come of age. Sophisticated models of 
ion-aerosol processes now exist. They are sup- 
ported by observations and laboratory stud- 
ies, which will include the upcoming CLOUD 
experiment at the CERN laboratory near 
Geneva, Switzerland, in which a proton beam 
will generate highly controllable ionization 
events in an aerosol-cloud chamber”. rT] 
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GALAXY FORMATION 


Anatomy of elliptical galaxies 


Luca Ciotti 


The family of elliptical galaxies is remarkable for the structural regularity 
of its members. Inspecting irregularities in this regularity could help in 
understanding how these galaxies form. 


One of the most-debated subjects in modern 
astrophysics is how elliptical galaxies, which 
are among the oldest known objects in the 
Universe, formed. Among the various likely 
formation mechanisms, merging is the most 
popular. According to this theory, different 
galaxies are the aftermath of merger events 
between progenitors of different morpholo- 
gies and of varying encounter geometries. But 
observations indicate that there is room for 
other mechanisms. Despite great endeavour in 
trying to match the regularities observed in the 
structures of elliptical galaxies with theoreti- 
cal models, there is still no consensus view of 
how they formed. Writing in The Astrophysical 
Journal Supplement Series, Kormendy and col- 
leagues’ report a meticulous study of all known 
elliptical galaxies in the Virgo cluster (one of 
the clusters of galaxies nearest to Earth) that 


investigates how departures from the observed 
regularities can be diagnostic of the processes 
that triggered the formation of these galaxies 
(Fig. 1, overleaf). 

The most striking property of elliptical gal- 
axies is that their brightness profiles — that is, 
the way in which the combined luminosity of 
their stars varies with distance from the cen- 
tre — depend in a regular way on their total 
luminosity (Sérsic’s law). Other properties of 
elliptical galaxies that correlate with their total 
brightness include size, mean star velocity and 
metal content. Another trait shared by these 
stellar systems is a supermassive black hole, 
with a mass of the order of one-thousandth of 
the galaxy’s stellar mass, at their centre’. 

In their study of the Virgo cluster of galax- 
ies, Kormendy et al.’ report galaxy luminos- 
ity profiles over large radial ranges and argue 
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that elliptical galaxies are not arranged as a 
continuous sequence of objects with proper- 
ties that scale well with their total luminosity. 
Instead, elliptical galaxies seem to branch out 
into two families according to a threshold value 
for the total luminosity. This dichotomy mani- 
fests itself in two kinds of departure from the 
Sérsic law at small radii. Luminous ellipticals 
have ‘cuspy’ cores — that is, their luminosity 
profiles are characterized by ‘missing light at 
small radii, because their brightness at such 
radii drops below the Sérsic-fitted, larger-radii 
profile. By contrast, less-luminous ellipticals 
are all ‘coreless’ — their central luminosity pro- 
files seem to have ‘extra light at small radii (but 
see Graham ef al.’ for a different interpretation 
of the central-light profiles). 

Kormendy and colleagues’ results add weight 
to other observations that have hinted at a 
dichotomy in the properties of elliptical galaxies. 
Luminous-core galaxies are known to be slowly 
rotating; to be relatively anisotropic (properties 
such as stellar velocities depend on direction); 
to have triaxial shapes (they have different diam- 
eters in all three directions); to have quite ‘steep 
Sérsic profiles; and to have stars that are mostly 
very old and that formed on comparatively short 
timescales. Conversely, low-luminosity coreless 
ellipticals rotate rapidly; are more isotropic; have 
mostly oblate-spheroidal shapes; have quite 
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‘shallow’ Sérsic profiles; and have stellar popula- 
tions that are younger and that formed on longer 
timescales. It is also well known that core ellipti- 
cals contain hot X-ray-emitting gas and tend to 
be radio-loud (they have strong radio emission), 
whereas the coreless galaxies do not*. 

The global regularities of the family of 
elliptical galaxies have already been used to 
set important constraints on their formation 
mechanisms**. Kormendy and colleagues’ 
analysis now demonstrates that studies exam- 
ining departures from those regularities could 
provide even more compelling clues. One pre- 
viously recognized possibility is that the cores 
of luminous, large ellipticals could be natu- 
rally scoured by binary black holes’, one likely 
product of the merging of two smaller elliptical 
galaxies, each of which harbours a central black 
hole. But binary black holes should also be 
found in coreless elliptical galaxies if merging 
is the main formation mechanism of the whole 
family of elliptical galaxies. So what thwarted 
core scouring by binary black holes in coreless 
ellipticals? 

Kormendy et al.' suggest that if enough cold 
gas (provided by the merging companion) is 
carried to the centre of the forming coreless 
galaxy and turned into stars that provide the 
extra light, it could swamp core scouring by 
binary black holes. The picture that emerges 
from the new study’, therefore, is one in which 
luminous-core ellipticals are the products of 
mergers between progenitors that had large 
numbers of old stars but little or no cold gas 
(‘dry’ mergers), whereas less-luminous coreless 
galaxies are the aftermath of mergers between 
(cold-)gas-rich galaxies (‘wet’ mergers). The 
wet mergers would trigger starburst activity; 
any energy released by the final black hole (the 
product of merging between the two progenitor 


334 


Figure 1| The 
deceptive looks 

of elliptical 

galaxies. Elliptically 
shaped galaxies, 
such as the larger 
system ESO 325- 
G004 (top left) seen 
in this Hubble Space 
Telescope image 

of the Abell cluster 
of galaxies S0740, 
appear to be quite 
simple objects. 
Kormendy and 
colleagues' show 
that these apparently 
uncomplicated 
galaxies disclose 
unexpected 
intricacies that could 
shed light on the 
mechanisms of their 
formation. 


black holes) because of gas accretion would not 
be sufficient to heat the gas and quench star 
formation. 

Although the hypothesis discussed by Kor- 
mendy et al. is appealing, it is not the final 
word on elliptical-galaxy formation. Major 
evolutionary mechanisms are thought to be 
at work within elliptical galaxies. For example, 
in the several billion years of their evolution, 
the dying stars of an elliptical galaxy release an 
amount of gas equal to about 20-30% of the 


initial stellar mass of the galaxy. However, the 
observed present-day black-hole-to-galaxy 
mass ratio empirically dictates that, at most, 
1% of this gas can be accreted by the central 
black hole. Therefore, essentially all of this gas 
is available to be recycled by the galaxy in some 
way other than black-hole accretion. Ifonly a 
minor fraction of this gas flowed to the centre 
of the galaxy and was transformed into stars, it 
could suffice to produce the central extra light 
in low-luminosity ellipticals*, whereas galaxy 
merging could be solely responsible for the cre- 
ation of cores in high-luminosity ellipticals. 

Increasingly refined computational 
simulations””’ are being devised to untangle the 
relative contributions of the different physical 
processes involved in galaxy formation. For now, 
Kormendy and colleagues’ results demonstrate 
that research on elliptical galaxies, which were 
once considered rather dull objects, is revealing 
surprising secrets about galaxy formation. ™ 
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ECOLOGY 


Towards a theory of biodiversity 


Jayanth R. Banavar and Amos Maritan 


Models of ecological communities that incorporate mutation and spatial 
dispersal can yield results that go some way to explaining observations. 
A further step is to add sexual reproduction to the mix. 


Humans are wreaking havoc on the planet’, 
and the result is a continuing mass extinction 
of species. Yet there is little understanding 
of the various ecological effects of this loss of 
biodiversity. An initial requirement is to work 
out how an ecological community is assembled 
in the first place, and on page 384 of this issue 
de Aguiar et al.’ offer an advance in tackling 
this formidable task by extending an explana- 
tory approach to biodiversity known as neutral 
theory. 

First, some context. Life isa non-equilibrium 
phenomenon involving a ‘dual-diffusiom pro- 
cess. Organisms diffuse and distribute them- 
selves in space, and compete for resources, 
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such as light, water and nutrients. At the same 
time, on different timescales, there is diffusion 
in ‘genome space’ that leads to new species and 
serves to maintain diversity. The evolution 
of an ecological community corresponds to 
the dynamics of the distribution of occupied 
regions in real space and genome space, and 
the development of interactions between the 
two. Owing to the constraints of the available 
resources, growth in the population of one spe- 
cies must necessarily lead to a decrease in the 
population of another and, in some cases, to 
extinction. Niche effects**, which arise from 
the competition for resources in a spatially 
heterogeneous environment, provide a simple 
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rationalization for species coexistence, rarity 
and extinction. They can, to some extent, be 
used to predict the observed distribution of 
species and their traits along nutrient gradients 
within the environment. 

The environment is itself shaped by the indi- 
viduals that make up the community, leading 
to a dynamic feedback mechanism. Thus, an 
ecological community is complex, with myriad 
interactions among and between species, as 
well as interactions of species with a spatially 
inhomogeneous and temporally changing 
environment. It isa many-body system exhib- 
iting emergent characteristics®. To compound 
matters, many of the interactions are unknown 
or, at best, imperfectly known. 

A fundamental understanding of 
biodiversity requires a theoretical framework 
based on a restricted set of verifiable hypoth- 
eses. An example of such an approach is neu- 
tral theory — presented in 2001 by Hubbell’ 
— which predicts patterns of organisms that 
agree well with observational data. Hubbell’s 
simplifying idea was to treat all species at the 
same trophic level (position in a food chain) 
as equivalent to each other. The rarity of one 
species and the high abundance of another 
do not result from their distinctive traits, but 
rather from stochastic drift in populations of 
essentially identical species. Species diversity 
comes about not from differences among spe- 
cies, as highlighted in niche theory, but rather 
from the interplay between extinction caused 
by random drift in species populations and the 
introduction of new species by either immigra- 
tion from a surrounding region or speciation. 

In the new paper, de Aguiar et al.’ present 
computer simulations of a specific neutral 
model of dual diffusion along with the process of 
sexual reproduction. Previous approaches took 
into account diffusion in real space and postu- 
lated the appearance of new species through 
mutation without explicit consideration of the 
evolution of the genetic codes of the breeding 
organisms. In the simulations, de Aguiar et al. 
observe the spontaneous emergence of species, 
a group of individuals with genetic codes occu- 
pying a sub-region of the whole genome space, 
living in a sub-region of the space available to 
the community. This dual segregation arises 
from the double constraints on the difference in 
the genetic codes as well as the spatial distance 
between prospective mates, and does not require 
the presence ofa geographical barrier. The aver- 
age distance between a randomly chosen pair 
of points in a community of fixed size is higher 
in lower spatial dimensionality; consequently, 
segregation is easier in lower dimensions and 
leads to greater species diversity. 

Remarkably, the results of the simulations, 
with many adjustable parameters, are in good 
accord with a variety of observed patterns. 
Examples of such patterns are the constant rate 
of speciation observed in the fossil record; the 
higher diversity of freshwater ray-finned fishes 
than of their marine counterparts; the species— 
area relationships of birds, flowering plants and 


tropical-forest trees; and the relative species 
abundance of birds and forest trees. 

A framework for understanding biodiversity 
requires identification of the essential ingredi- 
ents and an understanding of their effects. Along 
with other colleagues, we have used simplified 
models with no explicit interactions between 
species or with the environment to derive ana- 
lytical expressions for relative species abun- 
dance’, the spatial distribution of individuals’, 
and the dynamics of an ecological community”, 
and these fit the empirical data moderately well 
with relatively few adjustable parameters. A les- 
son from these calculations is that just because a 
theory fits the data, it does not necessarily imply 
that the assumptions underlying the theory are 
correct. In fact, one can sometimes fit data 
such as relative species abundance equally well 
through distinct hypotheses showing that addi- 
tional data, not always measured, are needed to 
discriminate between competing biological pos- 
sibilities. The proper definition of interactions, 
the means to infer them from data, and their 
incorporation into theory leading to testable 
predictions remain a huge challenge. 

Among other lines of investigation, what 
is needed next is work along two fronts: the 
development of analytically tractable models 


that include the innovations introduced by 
de Aguiar et al., and careful analysis of field 
data to assess whether the underlying assump- 
tions of the simulations are valid. The journey 
towards understanding biodiversity will be a 
long one, but it is encouraging that we are at 
least taking steps down that road. a 
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CLIMATE CHANGE 


Beyond the CO, connection 


Rainer Zahn 


At times in the past, mobile ocean fronts in the subtropics have exercised an 
influence on the magnitude of climate change by decoupling temperature 
from levels of carbon dioxide in the atmosphere. 


During the past 800,000 years — the late 
Pleistocene and subsequent Holocene — 
Earth's climate has swung between cool and 
warm and back again many times. Variations in 
incoming solar radiation, modulated by Earth's 
orbital parameters’, along with recurring ups 
and downs in atmospheric carbon dioxide con- 
centrations’, are among the prominent features 
associated with these climate swings. At first 
glance, then, it looks like climate is intimately 
coupled with solar radiation and CO,. But on 
closer inspection, differences become appar- 
ent between the amplitudes of climate changes, 
and CO, variations in particular, that raise the 
question of how tight that coupling is. 

As they describe on page 380 of this issue’, 
Bard and Rickaby have addressed this ques- 
tion. They analyse palaeoclimatic records from 
a marine sediment core in the southwestern 
Indian Ocean, and show that the ocean front 
separating the warm subtropical ocean from 
the cold subantarctic zone may influence 
global-scale temperature control. Bard and 
Rickaby use a suite of data records to monitor 
latitudinal migrations of the subtropical front 
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as climate cycled between ice ages and warm 
ages during the past 800,000 years. What makes 
their study particularly persuasive is that the 
sediment core from which they draw their data 
comes from the entrance to the ocean gateway 
that connects the Indian Ocean with the Atlan- 
tic Ocean at the southern tip of Africa. 

At this gateway, warm and salty waters leak 
from the Indian Ocean into the South Atlantic, 
forming part of the global ocean thermohaline 
circulation and compensating for the export of 
deep water from the Atlantic to the rest of the 
world ocean. The salt-water transport to the 
South Atlantic causes a south-to-north den- 
sity gradient in the Atlantic as a whole that has 
the power to influence the Atlantic’s meridi- 
onal overturning circulation and, ultimately, 
the Gulf Stream, with consequences for heat 
transport and climate across the North Atlantic 
region*°. In theory, the latitudinal position of 
the subtropical front defines how wide the gate 
is left open for the salt-water transport from 
the Indian Ocean to the Atlantic. 

Ocean fronts are boundaries between water 
masses of different temperature and salt content, 
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Figure 1| The Agulhas current and the subtropical front. The Agulhas is the largest ocean jet current 

in the Southern Hemisphere, and carries warm, salty waters from the South Indian Ocean subtropical 
gyre along the eastern edge of southern Africa. Most of the current feeds back into the subtropical 
gyre, but some of its waters flow through the ocean gateway off the southern tip of Africa and into the 
Atlantic. This ‘leakage’ alters the density structure of the South Atlantic, with northward propagation 
of subsurface pressure waves potentially affecting the meridional overturning circulation in the North 
Atlantic**, which has global effects. Migration of the mobile subtropical front, as described in the work 
of Bard and Rickaby’* , is diagnostic of shifted wind patterns that can alter the Agulhas Current and the 
southern African ocean gateway and so, potentially, the choreography of ocean-climate interactions. 


and so of different density. That they affect 
ocean circulation and climate is not new. In 
particular, it has long been recognized’ that, 
during the late Pleistocene, the North Atlantic 
polar front exerted control over the Atlantic 
overturning circulation and thus over climate 
in the Northern Hemisphere. Gains and losses 
in seawater density near the centres of deep- 
water formation were tightly linked with move- 
ments of the polar front. Shifting the front to 
a northerly position during the warm periods 
between ice ages allowed warm subtropical 
waters to penetrate far into cold subpolar lati- 
tudes, invigorating evaporation that increased 
the salt concentration and hence the density of 
sea water, thereby stimulating deep convection. 
A southerly position of the polar front during 
ice ages tended to reduce the thermal con- 
trast between surface water and the overlying 
atmosphere, reducing evaporation and salti- 
ness and causing a less-vigorous deep convec- 
tion. Although it seems simplistic, the validity 
of this concept was demonstrated in the early 
days of palaeoceanography’*, and since then the 
focus has been fixed on the high latitudes. 
Bard and Rickaby’ shift attention to the 
subtropics by postulating that the subtropical 


336 


front has likewise acted as an agent of change. 
Two ‘superglacial intervals serve as test cases 
— marine isotope stages 10 and 12 — during 
which Earth experienced unusually cold cli- 
mates despite levels of atmospheric CO, that 
were no different from those during the pre- 
ceding and subsequent ice ages. The suite of 
temperature-sensitive geochemical and faunal 
records that Bard and Rickaby present consti- 
tute a compelling case that, during both super- 
glacials, the subtropical front was shifted by as 
much as 7 degrees latitude to its northernmost 
position of the past 800,000 years. 

In this position, the subtropical front poten- 
tially would have prevented water transport to 
the Indian-Atlantic gateway by intercepting 
the Agulhas current — which runs south along 
the eastern edge of southern Africa — closing 
the gate on the inter-ocean transport of water 
(Fig. 1). In consequence, the Atlantic overturn- 
ing circulation could have been forced into an 
unprecedentedly slow mode. This then reduced 
the poleward oceanic transport of heat, so the 
thinking goes*®, to the extent that ice sheets in 
the Northern Hemisphere grew beyond their 
normal limits and climate cycled into a severe 
cool state. 
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So much for the concept. But what might 
have caused the subtropical front to undergo 
such an exceptional northward migration? The 
position of the front is a consequence of the 
processes that alter the temperature distribu- 
tion at the southern reaches of the large oceanic 
subtropical gyres. Today, a central element is 
the interplay between the easterly trade winds 
in the subtropics and the westerly winds that, 
in turn, interact with the ocean to move water 
(and heat) around its surface. The dynamics 
that would have caused the wind patterns to 
change, such that the subtropical front may 
have shifted that far north during marine iso- 
tope stages 10 and 12, are not yet clear. 

It also remains to be seen if a northward- 
migrating subtropical front would, by itself, 
have been strong enough to block a current 
as mighty as the Agulhas and so affect water 
transport to the Atlantic. A different way 
of altering the scale of the leakage is for the 
Agulhas current to shift gears. The strengths 
of currents typically vary in parallel with shift- 
ing wind patterns and could act to invigorate 
or weaken the Agulhas leakage to the Atlantic. 
The details of this mechanism have not yet 
been fully explored”””. But the strength of the 
Agulhas has been steadily increasing since the 
1980s, driven by changed gradients in atmos- 
pheric pressure and in wind fields". 

Bard and Rickaby’ make a compelling case 
that, at times in the past, severely reduced water 
transport between the Indian and Atlantic 
oceans may have caused climate to cool beyond 
typical ice-age conditions. But will the reverse 
also hold? Will an increased leakage (forced 
by recent shifts in atmospheric pressure and 
winds’) compensate for the loss of salt in the 
North Atlantic’”” (caused by increased precip- 
itation and changes in the freshwater flux from 
the Arctic)? Would this be enough to stabilize 
the Atlantic overturning circulation? Let’s keep 
an eye on what the leakage does next. a 
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PHOTONICS 


Light control at will 


Sajeev John 


Microchips that make use of light instead of electrons could outperform 
their electronic counterparts if light flow can be controlled at will. Photonic 
crystals are instrumental in achieving such a manoeuvre. 


Photonic crystals are artificial materials made 
of periodically arranged insulating structures 
that are specially designed to aid the trap- 
ping and confinement of light within a very 
small area. These crystals offer the possibility 
of confining light on sub-wavelength scales 
in air, and provide a foundation for guiding 
light through circuit paths in a three-dimen- 
sional (3D) optical (as opposed to electronic) 
microchip without energy loss caused by light 
scattering. The crystals also enable unprec- 
edented strong coupling effects between light 
and semiconductor nanocrystals, known as 
quantum dots. These effects provide a route 
to fast information processing and computing 
by optical means. However, such tantalizing 
opportunities require that photonic crystals 
be accurately synthesized and that light be 
efficiently manipulated both inside and at 
the surface of the crystal without loss. In this 
issue (page 367), Ishizaki and Noda’ take an 
important step towards achieving these goals 
by demonstrating the control of light at the 
surface of a suitably constructed, gallium- 
arsenide-based, 3D photonic crystal. 

The special, regularly repeating arrangement 
of insulating (dielectric) structures in phot- 
onic crystals gives them a special property: 
light waves with frequencies that fall within a 
range known as the photonic band gap (PBG) 
are prevented from propagating because of 
destructive wave-interference effects. This 
provides a clean slate on which to write optical 
circuit paths. Ishizaki and Noda’s woodpile- 
type structure consists of layers of dielectric 
rods stacked together (see Fig. 1a on page 367). 
Although it exhibits a bulk 3D PBG — that is, 
frequencies within the PBG are blocked from 
flowing inside the material in all possible direc- 
tions — there are frequencies (modes) within 
the otherwise forbidden gap in which light can 
leak from the crystal interior to the exterior 
surface, travel freely along the surface of the 
crystal and eventually escape into free space. 
In other words, the edges of the slate are not 
completely clean. 


Modification 

To control the leakage of light into unwanted 
surface modes, the authors’ reapply the PBG 
concept to the surface itself. They modify the 
surface layer of the material by constructing 
a two-dimensional (2D) periodic structure 
to create a 2D PBG for the surface modes. 
The spectral overlap between the bulk 3D 
PBG and the 2D surface PBG then provides 


the desired frequency window in which no 
electromagnetic waves can propagate, either 
within the woodpile architecture or along its 
surface. With this completely clean slate, the 
last remaining escape pathways for light are 
removed and light within the 3D microchip 
can be controlled at will. 

The experimental demonstration’ of a 
simultaneous 3D PBG and 2D surface PBG 
has several implications. Ishizaki and Noda 
have used this to demonstrate a strongly 
localized state on the surface of their woodpile 
structure that persists for around 9,000 opti- 
cal cycles before it releases its trapped light to 
free space. This light trap was engineered by 
introducing a defect — in the form of a local 
deviation from horizontal periodicity — on 
the surface of a thin 3D photonic crystal; that 
is, one with only two repeating units of vertical 
periodicity. If thicker 3D crystals were used, 
the lifetime would be expected to increase 
dramatically. 

However, this is only one small indicator 
of the implications of the authors’ successful 
integration of defects of different dimensional- 
ity within 3D PBG architectures. For example, 
circuits for light in an optical microchip are 
based on one-dimensional defects known as 
linear waveguides”. In optical communications 
and information processing, a crucial issue is 
the ‘insertion loss’ as light enters the in-chip 
waveguide from an external optical fibre. The 
efficiency of the coupling between the chip 
and the fibre is limited by light scattering into 
unwanted surface modes. Unlike their conven- 
tional equivalents, photonic-crystal fibres’ — 
those carved out of otherwise-perfect 2D PBG 
crystals — would enable the efficient transfer 
of optical information from fibre to chip, pro- 
vided that unwanted surface modes on the 3D 
chip are eliminated*. Ishizaki and Noda’ results 
provide the required clean slate. 

The 3D optical microchips of the future will 
require not only the bulk 3D PBG crystal to 
provide overall confinement of light within 
the chip, but also the integration of lower- 
dimensional defects and band-gap materials. 
This would completely control the flow of 
light and tailor its nonlinear interaction with 
the optical resonators embedded within the 
photonic crystal. One possible solution is to 
use PBG heterostructures, which involve sand- 
wiching a layer of photonic crystal with a 2D 
PBG between two identical 3D PBG materi- 
als’. In their study’, Ishizaki and Noda have, in 
effect, synthesized the open-faced version of 
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the sandwich, a step towards constructing and 
demonstrating light flow within the fully 3D 
circuitry of a 3D optical microchip’. 

Optical computing in a photonic micro- 
chip requires that tiny pulses of laser light pass 
through the waveguide circuitry and that these 
pulses, in turn, control and switch the flow of 
other optical pulse streams. In free space, these 
pulse streams would simply pass through each 
other, transferring no information from one 
to the other. By coupling these optical pulses 
to ‘artificial atoms (quantum dots) embedded 
near or within a photonic-crystal waveguide, 
it should be possible to control light with 
light®. When the quantum dots are excited 
or de-excited by one pulse stream, they can 
coherently amplify or attenuate pulses in the 
second stream of optical information, enabling 
information processing. 


Confinement 

In conventional materials, high-power laser 
beams are needed to make this nonlinear 
process occur. However, with less than a 
milliwatt of laser power flowing through a 
3D PBG waveguide, it is possible to deliver 
electric-field amplitudes in the range of 10,000 
-100,000 volts per centimetre as a result of 
the confinement of light modes on sub- 
wavelength scales by the 3D PBG material. 
This tight confinement and concentration 
of light prompts an otherwise unattainable 
response from quantum dots at modest levels 
of laser power. Whereas the 3D PBG is often 
associated with the inhibition of spontaneous 
emission of light from a quantum dot, suit- 
ably embedded planar and linear waveguides 
can be used to engineer spectral windows 
within the PBG in which the rate of sponta- 
neous emission relative to that in free space is 
increased by several thousand. 

This enhancement of the coupling of light 
to quantum dots, or more generally of the 
strength of light-matter interactions, will help 
novel quantum-electrodynamics phenomena 
— such as ‘vacuum Rabi splitting’ in single 
quantum dots’ and large ‘Mollow splitting’ ina 
collection of quantum dots® — to emerge. The 
synthesis of high-quality 3D PBG materials and 
the control of their surface optical modes as 
demonstrated by Ishizaki and Noda is progress 
towards these and other far-reaching goals in 
both basic and applied science. | 
Sajeev John is in the Department of Physics, 
University of Toronto, Toronto, Ontario M5S 1A7, 
Canada. 
e-mail: john@physics.utoronto.ca 
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Jean Dausset (1916-2009) 


‘Father’ of the human leukocyte antigen system. 


Jean Dausset, who died in Majorca, Spain, 

on 6 June 2009, aged 92, will always be 
remembered for the discovery of the 

human leukocyte antigen (HLA) tissue 
system, one of the most significant scientific 
breakthroughs of the twentieth century. 

The HLA system — a collection of genes that 
encode polymorphic molecules expressed 

on the surface of cells — determines the 
biological identity of an individual's cells 

and tissues. Because recognition of HLA 
molecules by the immune system allows it to 
distinguish self cells from foreign cells, this 
discovery laid the foundations for successful 
organ transplantation. Dausset not only 
pioneered the scientific and medical research 
into HLA, but also embraced and promoted 
the sociological and philosophical impacts of 
this finding. 

Dausset was born in Toulouse, France, 
into a provincial bourgeois family, and 
spent his youth in Biarritz before the family 
moved to Paris. He completed his medical 
education at the Faculty of Medicine at the 
University of Paris. During the Second World 
War, he joined the French army transfusion 
service and participated in the Tunisian and 
Normandy campaigns. It was in the Army 
Medical Corps, where he oversaw large 
numbers of blood transfusions, that his drive 
to understand the causes of frequent adverse 
reactions against blood cells was born. 

As part of the Marshall Plan, he attended 
Harvard University for a year, in 1948, 
where he worked as a clinical haematologist 
at the Peter Bent Brigham Hospital. On 

his return to Paris, he specialized in 
immunohaematology. 

In 1952, Dausset made the seminal 
observation that serum from patients who 
had received multiple blood transfusions 
caused marked agglutination of white blood 
cells (leukocytes). Dausset attributed this 
leukoagglutination to an antibody in the 
serum directed against the white blood 
cells. In 1958, he described the first human 
leukocyte antigen, which he named MAC 
(now known as HLA-A2), in a single-author 
publication. He went on to define the genetic 
region that encodes the complex system of 
cell-surface antigens, which he termed HU-1, 
later renamed human leukocyte antigen. 

In the 1960s, Dausset teamed up with the 
surgeon Felix Rapaport in Paris. Together 
they performed hundreds of skin-graft 
experiments on volunteers, correlating 
graft survival with the extent of HLA 
incompatibility and thereby demonstrating 
the role of HLAs in human transplantation. 
He considered these volunteers the ‘real 
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heroes’ of the discovery of the HLA system 
and always acknowledged their contribution 
on receipt of honours. 

Today, many years after his discovery, 
tissue typing (determining the combination 
of HLAs on cells) is an essential tool for 
selecting donors for organ and bone-marrow 
transplantation. Dausset’s pioneering work 
on the HLA system was recognized in 1980 
when he was awarded the Nobel Prize in 
Physiology or Medicine, together with 
Baruj Benacerraf and George Snell, for 
“their discoveries concerning genetically 
determined structures on the cell surface 
that regulate immunological reactions’. 

Although Dausset’s own contributions 
were not in the field of immunology, he 
always insisted that HLAs were important 
in the control of immune responses as 
established by Hugh McDevitt, a colleague 
he particularly admired. They both 
anticipated that the HLA system contributed 
to susceptibility or resistance to disease, 
including autoimmune disease. With 
extraordinary prescience, Dausset extended 
this principle to espouse the concept of a new 
form of medicine that used an individual's 
genetic profile to predict disease — he named 
this ‘predictive medicine’ and foresaw its 
potential in personalized disease prevention. 

Dausset recognized that the HLA 
system was a tool that could be used 
to analyse the genetics and migration 
patterns of populations. His interest in the 
anthropological implications of his discovery 
took him on a voyage to Easter Island, where 
he collected blood samples from this isolated 
population for genetic analysis. 
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He also realized that the HLA model 
could be extended to the entire human 
genetic system and, in 1984, together 
with Daniel Cohen, created the Human 
Polymorphism Study Centre (CEPH) — 

a resource that facilitates genetic studies 

of human populations. The genotyping of 
large numbers of reference families by the 
CEPH, and the sharing of this information 
with scientific peers, contributed to the 
determination of the first genetic linkage 
map of the human genome, and led the way 
to the discovery of genes that predispose to 
numerous diseases. 

In addition to his medical research, 
Dausset worked as a French government 
adviser from 1955 to 1958, and was 
instrumental in shaping French law 
integrating science education and medicine, 
leading to the creation of the CHU 
(University Hospitals). In 1969, he founded 
France Transplant, an organization that 
coordinated organ transplantation across 
France. With Jean Bernard, his colleague at 
the Hopital St Louis, Paris, he established 
the French Bone Marrow Donors Registry 
of unrelated donors for bone-marrow 
transplantation. When these organizations 
were taken over by national administration, 
he voiced concern that the humanistic nature 
of these endeavours would be diluted by 
bureaucracy. Dausset was eventually elected 
president of the Universal Movement for 
Scientific Responsibility (MURS), and 
actively campaigned against patenting the 
human genome sequence. 

Dausset never had a mentor. The discovery 
of the HLA system stemmed from his own 
vision and was initially driven solely by 
his ferocious determination. By contrast, 
for more than 50 years as a scientist, 
he embodied a spirit of international 
collaboration, gathering around him 
numerous students, colleagues and friends, 
whom he encouraged and supported. This 
extended worldwide community, which 
had a strong South American and Hispanic 
flavour, firmly established him as the ‘father 
of HLA. He enjoyed this parental status and 
designated the immunologist Rose Payne the 
‘mother’ of HLA. 

A passionate collector of modern plastic 
arts, for a time he owned an art gallery in 
Saint Germain des Prés, Paris. He poetically 
summarized his philosophy of life in the 
title of his autobiography Clin d‘Oeil a la Vie 
(A Wink at Life). His wife Rosita, daughter 
Iréne and son Henri were a constant 
support. Dausset died peacefully in Majorca, 
surrounded by his family and immersed in 
the Hispanic culture he was so fond of. His 
uniqueness will endure. 

Dominique J. Charron 

Dominique J. Charron is at the Laboratoire Jean 
Dausset, Hépital Saint-Louis, and the Institut 
Universitaire d'Hématologie, 75010 Paris, France. 
e-mail: dominique.charron@sls.aphp.fr 
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& Rebecca Warren* 


All modern humans use tools to overcome limitations of our anatomy and to make difficult tasks easier. However, if tool use 
is such an advantage, we may ask why it is not evolved to the same degree in other species. To answer this question, we need 
to bring a long-term perspective to the material record of other members of our own order, the Primates. 


everal animal species use tools and selectively manipulate 

objects. Primate tool use has received particular attention, 

in part because of the close evolutionary heritage that other 

primates share with technology-dependent humans. 
However, the effects of material culture on primate long-term adap- 
tiveness have yet to be systematically explored. Here we review the 
interface of primatology and archaeology, following a recent intensi- 
fication of research into the relevance of other primates for under- 
standing hominin technology and behaviour. 

Extensive and flexible tool use was once considered a defining human 
characteristic’. Detailed observation since the mid-twentieth century 
has, however, revealed a wide variety of habitual tool use among wild 
chimpanzees (Pan troglodytes)’ in Africa, adding to records of less com- 
plex tool use by other species. At the same time, stone artefacts made by 
hominins—the human lineage since the split with chimpanzees some 
5-7 Myr ago*—have been found dating back 2.6 Myr (ref. 5). Because 
this earliest Oldowan technology already shows much planning depth, 
spatial coordination and manual dexterity’ in its creators, it is probable 
that earlier, currently unrecognized, tool manufacture occurred. 
Appeals to phylogenetic proximity therefore posit the last common 
ancestor of chimpanzees and humans to have been a tool user”*, and 
for many archaeologists chimpanzees have become the dominant 
referent for modelling early hominin behaviour. However, recent 
recognition that wild South American bearded capuchin monkeys 
(Cebus libidinosus) also habitually use tools’, whereas wild bonobos 
(Pan paniscus, the chimpanzee’s closest relative) rarely do'°, forces us 
to rethink the accepted roles of continuity and convergence in primate 
tool use. We may ask, for example, how many extinct primate groups 
independently ‘invented’ tool use during the past several million years, 
and what circumstances permitted or prevented such discoveries. 
Questions also arise as to the influence of long-term tool use on non- 
human-primate anatomy, and the reasons why hominins alone have 
taken tool use to such an extreme. 

A long-standing separation of anthropocentric archaeology from 
primate ethology has obscured the holistic perspective required to 
address these questions. Here, our solution is to introduce a new 
interdisciplinary field—primate archaeology—that examines the 
past and present material record of all members of the order 
Primates. This field provides a comprehensive comparative and 
long-term evolutionary framework for understanding the biological, 
environmental and social contexts of primate behaviour, through 


analyses of tool making, tool use and the spatially patterned accu- 
mulation of refuse. Examination of the contexts for non-human- 
primate artefact and landscape use, alongside early hominin equiva- 
lents, provides a new understanding of the origins and evolution of 
human behaviour. Comparisons with the patterned use of material 
objects by living and extinct taxa outside the Primates will provide 
the first step towards a universal framework for unravelling the beha- 
vioural implications of both human and non-human components of 
the archaeological record. 


The intersection of primatology and archaeology 

Since the first systematic recording of wild-chimpanzee tool use, 
primatologists have stressed its direct relevance to early hominin 
studies'’. Beyond phylogeny, the main reasons for this applicability 
are the convenient physicality and durability of material culture, 
which permit measurement of behaviour and inference of intentions 
and abilities even if the user is absent or dead. From an archaeological 
perspective, definitions of ‘tool use’’? are therefore less important 
than recognition of the adaptive benefits and interpretive potential 
of manipulated durable objects. 

Free-living primate populations use a wide variety of plant materials 
for extractive foraging, social interaction and self-maintenance”'*"° 
(Fig. 1). Wild chimpanzees, orangutans (Pongo spp.) and capuchin 
monkeys use leaf, wood, twig, grass and bark tools, often modifying 
the items to better suit the targeted task'7'’. Wild western gorillas 
(Gorilla gorilla) have also recently been reported to use branch tools 
for postural support”. These observations of primate plant-tool use 
expand the conceptual framework for our interpretations of early 
hominin behaviour, as hominin wooden artefacts are only known from 
within the past 800,000 years”', and poor preservation of organic tools 
in the archaeological record means a significant amount of informa- 
tion about the origins of human technology is now lost. Use-wear 
traces and microscopic residues on stone artefacts may provide indirect 
evidence of plant-tool manufacture in the more remote past”. 

The enforced focus on inorganic archaeological materials has led to 
the development of highly informative methods of measuring and 
interpreting stone artefacts**, often centring on raw-material selection 
and transport, and on the manufacturing process and resultant forms. 
The key characteristic distinguishing stone from most organic materials 
is the potential for controlled formation of acute margins during frac- 
ture (that is, detachment ofa flake from a larger core), with the degree of 
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of Anthropology, Miami University, Oxford, Ohio 45056, USA. °Primate Research Institute, Kyoto University, Inuyama, Aichi 484-8506, Japan. ‘Department of Archaeology, 
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Figure 1| Chimpanzee plant use. a, Plants as tools: fishing for termites 
(Gombe Stream National Park, Tanzania). Chimpanzee extractive foraging 
such as this varies culturally between groups, independently of biological or 
environmental influences. b, Plants as construction: sleeping in a nest 
(Gombe Stream National Park). Chimpanzees make a ‘bed’ every night, and 


control reflecting motor skill and cognitive capacities®. The resulting 
sharp edges can be used for tasks such as butchering animals, processing 
plants or trimming wood. Oldowan assemblages dated to ~2.6— 
1.6 Myr ago typically consist of minimally worked cores and flakes with 
associated debris, although careful reconstructions have shown that on 
occasion more than 70 flakes were struck from a single cobble®. 
Oldowan sites also contain battered (not systematically flaked) stone 
tools (Fig. 2a), and although recognized early on as hammers and 
anvils”, only recently have these been analysed in detail and compared 
with chimpanzee pounding tools*. Recovery of cut-marked and 
damaged fossil bones at some early archaeological sites*® suggests that 


recent work has shown that reoccupation of the same localities may result in 
recognizable, patterned debris accumulation. ¢, Plants as activity sites: 
cracking oil-palm nuts using a tree-branch anvil and stone hammer 
(Liberia). Photo, A. C. Hannah. 


cutting edges were exploited primarily for greater access to meat or 
marrow. However, as a wide variety of plants were available to 
Oldowan hominins’’, the relative emphasis on vertebrate faunal 
exploitation over that of vegetation remains hypothetical. Early homi- 
nin invertebrate exploitation is little discussed, but is ripe for further 
enquiry owing to its prevalence in the primate world*”. 
Chimpanzees and capuchins use stone hammer-and-anvil combi- 
nations to crack hard-shelled nuts'**° (Fig. 2b, c), as well as anvils for 
opening hard-shelled fruits*'**. Island-dwelling long-tailed maca- 
ques (Macaca fascicularis) use stones to crack molluscs and crabs”, 
and bearded capuchins use stones to dig for tubers and to process 


Figure 2 | Primate stone-tool use. a, Three ~1.7—1.6-Myr-old Oldowan 
pounding tools from Olduvai Gorge, Tanzania. Provenance (left to right): 
FLK North level 1-2; FLK North level 5; FLK North sandy conglomerate. 
Scale bars, 1 cm. b, Chimpanzee cracking nuts with a stone hammer and 
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anvil (Bossou, Guinea). The full social complexity of this activity cannot be 
reconstructed from the archaeological record alone. c, Adult male capuchin 
cracking nuts using a stone hammer and wood anvil (Boa Vista, Brazil). Note 
erect body position and relatively large (1.44-kg) hammer. 
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plants such as cactus'***. These tools can become fractured, abraded 
or pitted as a result of repeated impact damage, leaving potentially 
diagnostic use-wear patterns (Fig. 3). Although non-human primates 
have not been seen systematically creating sharp stone flakes, fortui- 
tous breakage of an anvil margin while processing plant foods has 
been hypothesized as a first step towards deliberate stone knapping”. 
The origins of intentionally flaked stone technology are therefore an 
important meeting point between primatology and archaeology, and 
archaeologists regularly invoke chimpanzee plant-processing beha- 
viour in discussions of either pounding tools or the accidental 
creation of flakes from anvils**’°. Continued input from both fields 
is required to resolve incongruities between the hypotheses that 
vegetal processing led to the critical discovery of stone flaking, but 
that meat and marrow processing were the first uses of the discovery. 

The oldest archaeological sites have no hominin fossils in direct 
association with stone artefacts*®. Homo habilis, currently the earliest 
recognized member of genus Homo, lived ~2.4—1.4 Myr ago”’ and was 
hypothesized to be the creator of the initial Oldowan tools**. However, 
the time gap between the start of systematic stone flaking and the earliest 
Homo fossils suggests that earlier hominin taxa could be the innova- 
tors*’. The relative brain sizes and manipulative abilities of the hominin 
species potentially ancestral to early Homo (that is, Australopithecus 
africanus, Australopithecus afarensis and Australopithecus garhi) appear 
to be roughly equivalent to those of the extant great apes**”*’. This has 
prompted studies of non-human-primate capabilities. When they are 
motivated, captive orangutans”, bonobos* and capuchin monkeys* 
are capable of rudimentary stone reduction leading in some cases to 
sharp-edged, flaked cutting tools. The bonobo results are distinguish- 
able from Oldowan artefacts, although the main bonobo subject dis- 
played an unprompted inclination throughout the study to break rocks 
by throwing them onto a hard substrate, rather than using the typical 
Oldowan method of striking one hand-held stone against another*’. 
This suggests an alternative avenue through which australopithecines 
and earlier hominins may have obtained sharp stone edges, and raises 
the possibility that extinct panins (the chimpanzee lineage since the split 
with humans) may have used the same technique. 

Aspects of modern primate anatomy, such as a capacity for bipedal 
posture and the structure of the forelimb, wrist and hands”, provide 


Figure 3 | Wear patterns on chimpanzee pounding stone tools. Repeated 
use of favoured hammers and anvils builds up distinctive damage patterns, 
permitting archaeological identification of these tools. a, b, Flaking, 
crushing, pitting and corner removals. c, Cavity formation. Scale bar 
divisions, 1 cm. Photos, C. Boesch and J.M. (Tai National Park, Cote 
dIvoire). 
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valuable comparisons in assessing the capacities of hominins living 
before 2.6 Myr ago. Primate dentition is sufficient for many scraping, 
slicing, crushing and trimming tasks**“*, and this has been offered as 
an explanation for the apparent absence of stone flaking among wild 
non-human primates. Documenting the selective feedback from 
both technology and environment in shaping anatomical traits there- 
fore forms an important research focus of primate archaeology. That 
both 45-kg chimpanzees and 3-kg capuchins use stone hammers 
averaging around 1kg or more in mass*°*? demonstrates that we 
cannot posit an a-priori relationship between the size of a primate 
and its potential tools. Similarly, Cebus has a significantly higher 
encephalization quotient (the ratio of brain size to expected brain 
size as calculated from body mass or metabolism) than do most other 
primates, including Pan and the extinct hominins before Homo”. 
Ideally, comparative anatomy allows us to track the divergent paths 
taken by the various primate lineages since their split. For panins, 
however, we are hindered by an almost total lack of fossils*'. The 
hominin fossil record is better represented, but the likelihood of past 
adaptive radiations makes it unclear which of the known extinct 
species are direct ancestors of the human line. 

It is also unclear which of the extinct hominins were tool users. 
Australopithecus garhi has been suggested as being responsible for 
cut-marked bones dated to 2.5 Myr ago in Ethiopia’, and 
Australopithecus robustus has been posited as being a potential 
bone-tool user in South Africa**. Chimpanzees show cultural vari- 
ation in their tool use; for example, not all populations use stone tools 
to crack nuts", and wild bonobos and gorillas have not been observed 
to use extractive tools of any kind’*. Thus, we should not assume that 
groups of extinct hominins were homogenous in their forms of 
material culture. Moreover, the parsimonious explanation that wide- 
spread Homo and chimpanzee material culture indicates a tool-using 
common ancestor cannot reasonably be extended to the common 
ancestor of apes and Cebus or Macaca. The New and Old World 
monkeys split from the line leading to the apes around 35 and 
25 Myr ago, respectively’, and wild tool use in these lineages is rare. 
Convergence in tool use among primates is therefore plausible, 
probably owing to similar adaptive pressures, foraging requirements 
and physiological constraints. Significantly, acceptance of conver- 
gence in monkeys opens the door to the possibility of repeated gain 
and loss of tool use among extinct primates in multiple lineages over 
millions of years. From this perspective, it is unreasonable to expect 
that the only tool-using, non-hominin primates are extant species. 

For a comprehensive comparative study of the development of 
technology, we must establish the antiquity and form of tools used 
by non-human-primate ancestors (and potentially also tool-using 
species outside the primates), along with their ecological contexts. 
This aim has been advanced through excavation of chimpanzee nut- 
cracking sites in Tai National Park, Cote d’Ivoire’’*, where finds of 
accumulated cultural refuse dating back at least 4,300 years helped 
establish the legitimacy of primate-archaeological research. 
Recovered tools were interpreted as fragments of pounding stones 
used for processing nuts, supported by discriminatory analysis of 
adhering starch residues. Ongoing studies at Bossou and Diecke in 
Guinea focus on chimpanzee tool selection, spatial patterning and 
functional characteristics for a number of surface-context, present- 
day nut-cracking localities* (Fig. 4a). Similarly, research on wild 
bearded capuchins has examined such factors as the wear traces left 
by repeated use of anvils for pounding hard-shelled nuts”, tool trans- 
port and selectivity”® and the kinematics and energetics of nut crack- 
ing’. Capuchins consistently select hammer stones that are 
appropriate for the task by size and weight, displaying planning 
abilities and the rapid employment of visual, acoustic and haptic 
clues to correctly identify suitable tools**. Preferential accumulation 
of such materials at sites with appropriate anvils—the latter identifi- 
able from wear traces—produces a capuchin “activity area’ (Fig. 4b), 
the remains of which may last for millennia. 
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Figure 4 | Primate site creation. a, Selective transport of a stone hammer 
and anvil by an adolescent male chimpanzee (Bossou, Guinea). Over time, 
this behaviour accumulates artefacts in preferred tool-use sites. b, Stone 


Chimpanzee and capuchin spatial concentration of pounding tools 
such as anvils and hammer stones, and repeated, frequently seasonal, 
re-use of locations such as nut-tree sites, can form recognizable non- 
human archaeological assemblages’’. The viability of identifying 
pounding tools from pre-Oldowan times onwards is strengthened 
by modern actualistic studies—of nut cracking, meat processing, 
marrow extraction, digging-stick manufacture and _ stone-tool 
manufacture—that leave distinct use-damage patterns on the tools 
and act as proxies for identifying past pounding activities. Recording 
use-wear traces on experimental anvils and hammer stones, and on 
those used by free-ranging primates, also gives a clearer understanding 
of the dynamic behavioural and decision-making processes involved. 
Such cross-taxa comparative work is still in its infancy; however, 
ongoing research applying this methodology to Plio-Pleistocene 
anvils from Koobi Fora, Kenya, and Olduvai Gorge” suggests that 
the various pounding activities may be differentiated by their use 
traces. Field experiments exposing wild chimpanzees to a variety of 
raw materials for extractive foraging complement this work”, as does 
the study of percussion-marked fossil bones”. 

Pioneering studies applying archaeological recording methods to 
the spatial distribution of chimpanzee nests°’ suggested that chim- 
panzees accumulate organic artefacts through repeated occupation of 
a site. Current work confirms this hypothesis®’, providing insight into 
early hominin site formation processes. A recent report of wooden 
digging tools used by savanna-woodland chimpanzees also demon- 
strates the importance of an archaeological approach to primate 
plant-tool use’’. The presence of digging tools, holes, masticated 
underground storage organs, faeces and knuckle prints indicated that 
chimpanzees had been digging for underground storage organs, 
despite the absence of direct observations of this activity. Wild capu- 
chins also use sticks to probe’; moreover, they exhibit repeated use of 
caves to sleep in at night, to thermoregulate during the hottest part of 
the day, and to seek refuge in during heavy rain (EthoCebus project, 
unpublished observations). All primates leave remnants that can enter 
the archaeological record (for example faeces as coprolites), and beha- 
vioural patterns that emphasize repeated use of particular tree, cliff or 
cave sites**® may result in spatially patterned and therefore detectable 
accumulations in both open and forested environments. These find- 
ings have clear parallels with the archaeological interpretation of 
hominin activity areas; however, recognition of the highly uneven 
preservation of this ephemeral evidence in the archaeological record 
is challenging. 


The role of primate archaeology 


Acknowledging that only some primate taxa exhibit technology, 
despite their inherited biological similarities, primate archaeology 
has the following interconnected aims. The first goal is to use con- 
current archaeological and primatological methods for recording 
living non-human-primate tool use to learn about the evolutionary 
trajectories of primate behaviour from both anthropocentric and 
‘primatocentric’ perspectives. The second is to employ comparative 
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anvil pitted by capuchin nut-cracking activity (Boa Vista, Brazil). Two stone 
hammers were found on the anvil, and nut debris was removed from the 
anvil before recording. Image from ref. 30. 


studies of similar technologies across human and primate taxa (for 
example pounding, probing and digging tools) to investigate the 
origins of tool use among primates and its social, physiological, 
biomechanical and environmental contexts. The third is to examine 
the spatially patterned behavioural characteristics of primates that do 
not use tools, to determine whether these patterns are recognizable in 
the material record and how they might inform us about the likely 
activities of extinct primates (including hominins that did not use 
tools). The fourth aim is to identify how convergence has shaped 
primate tool use, and the implications that this has for the tech- 
nologies of extinct primate taxa. 

Inimplementing this agenda, the most complex results will probably 
derive from stone-tool-using species such as humans, chimpanzees, 
capuchins and macaques, as these generate a durable material record. 
However, until research methods in this field are much more rigorous 
and well established, it would be premature to make decisions on the 
potential contribution of any one taxon. Much of our knowledge is 
very recent: a mere decade ago the list of known wild-primate stone- 
tool users included only humans and chimpanzees, and half a century 
ago solely humans. 

Despite a number of research projects that examine pre-sapiens 
contexts, most archaeological investigations worldwide remain 
focused on the activities of Homo sapiens, largely within the past 
few tens of millennia. This agrees well with definitions of archaeology 
as an anthropocentric science. However, we contend that archae- 
ology should be viewed as a set of methods for the recovery and 
interpretation of behavioural evidence, irrespective of the species 
creating the record. This view is already common in the study of 
species that may be our direct ancestors (for example H. habilis), as 
well as those that are not (for example Homo neanderthalensis and 
Homo floresiensis). The difference is that the comparative study of 
dynamic living-primate behaviour offers greater potential for hypo- 
thesis testing and data generation than any study of static objects left 
by extinct hominins. Furthermore, although terms such as ‘chim- 
panzee archaeology’® and ‘cultural panthropology’® have been used 
previously, they exclude non-ape primates; an inclusive term for 
these endeavours promotes common methods and encourages 
cross-taxa comparisons. 

Non-human primates display stone and plant-material selection, 
processing and accumulation behaviours that challenge the conven- 
tional view of hominins as the sole creators of archaeological sites. If 
geographical and palaeoecological data suggest that a site was habitable 
by both hominins and other (potentially extinct) tool-using primates, 
then ambiguity arises over the extent to which either taxon contributed 
to any recovered assemblage. The only dated chimpanzee archaeolo- 
gical site faces this problem, human and chimpanzee stone tools being 
present in the same deposit™. The known innovative abilities of tool- 
using primates should therefore act as a cautionary brake in automa- 
tically assigning assemblages from such sites to any one taxon*’. 

Whereas there is only one extant species of hominin and two of 
panin, there are eight living capuchin species®’, only one of which 
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(C. libidinosus) is known to use tools frequently in the wild’. Primate 
archaeological research into the contexts of Cebus long-term tool 
selection, tool manipulation and spatial and social organizational 
strategies therefore offers one of the clearest opportunities to uncover 
important variables in the emergence of material culture. For 
example, we need to quantify the energetic costs of being a habitual 
tool user, and assess how these costs affect differential survival. As 
mentioned above, acceptance of the convergent evolution of tool use 
among Old and New World monkeys means that we should 
reasonably expect other species to have discovered, employed and 
then lost tool use throughout the past. On the basis of extant primate 
behaviour, we can expect some of these instances to have involved 
stone tools, and for those artefacts still to be recoverable today. 
Primate archaeology enables us to discover whether or not the long 
record of stone-tool use in the hominin line (at present covering 
around half the period since the common ancestor with Pan) is 
exceptional. 

Sharp-edged tools are not the only stone artefacts produced during 
the Oldowan and later periods, and more material was used for 
percussion than for flaking at some early sites*’. A perspective that 
emphasizes hominin systematic flaking obscures the potential con- 
tribution of non-hominin primates to the known archaeological 
record, and inhibits meaningful discussion of the factors behind 
cross-taxa parallels evident in other areas of technology. These factors 
include energy trade-offs between diet and tool manufacture, inter- 
and intraspecies variability in resource exploitation and tool func- 
tion, intentionality in tool material and attribute selection, and 
modes of social transmission. From a longer-term perspective, 
primate archaeology also contributes essential data to debates over 
the necessity and sufficiency of tool use in the evolution of bipedalism 
and terrestriality. 

Comparisons between taxa that have close morphological and 
genetic affinities with humans form a naturally strong research basis. 
However, the rationale behind primate archaeology applies also to 
non-primates, because any animal that accumulates or modifies 
durable materials will leave a signature in the archaeological record. 
Animals as diverse as crows’, dolphins”, beavers”’ and bowerbirds’! 
selectively manipulate material objects in regular patterns for extrac- 
tive foraging, niche construction and sexual display. Animals that use 
stones to break open resources, for example the portable anvils of 
Californian sea otters’ or fixed nut-cracking anvils of New 
Caledonian crows”’, may leave recoverable impact signatures that 
inform on the time depth and adaptive value of tool use in these 
species. Animal behavioural studies typically concentrate on current 
abilities rather than evidence for the evolutionary development of 
object manipulation, and even those field sites with multidecadal 
research data (for example Gombe Stream National Park for chim- 
panzees) have recorded only a tiny fraction of the lifespan ofa species. 
A clearer understanding of the behavioural remnants of non-human 
animals provides a ‘control experiment’ on the uniqueness of human 
artefacts. 


The future of primate archaeology 


Primate archaeology places the entirety of human behavioural evolu- 
tion into its wider comparative biological context, and its establish- 
ment clarifies the need for refinement of existing research programs. 
First, there is a need for systematic collaboration: few archaeologists 
have seen a wild primate use a tool, and few primatologists have taken 
part in archaeological excavations. Integrated involvement of disci- 
plines outside these areas, including comparative anatomy, cognitive 
science and evolutionary ecology, will also be essential. Second, 
standardization of recording procedures will combine an archaeolo- 
gical focus on material attributes and actions with the comprehensive 
spatially, temporally and socially patterned behavioural data 
generated by primatology. Third, creation of a comprehensive data- 
base comparing cross-taxa tool making and tool use in their environ- 
mental, biological and social contexts will enable exploration of the 
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selective costs and benefits involved, with the ultimate aim of iden- 
tifying those circumstances driving the uptake and continuance of 
tool use among primates, including human ancestors. Finally, direct 
documentation of the use of both human- and non-human-primate 
tools will allow us to tie specific actions to resultant residues and wear 
patterns, in turn helping us to understand why archaeologically 
recovered artefacts were created or chosen, producing valuable data 
on cognitive evolution. 

Greater recognition of the value of primate tool use and site 
creation to human evolutionary studies can only help in the protec- 
tion of rapidly declining primate populations. Field observation of the 
frequency of site visitation and the processes of site accumulation 
produces clues to the timescales represented at known hominin sites. 
Linking data to identified individuals within a primate group provides 
rich contextual information on cultural transmission and demo- 
graphic variation in tool use within social and kinship networks”, 
as well as emphasizing the roles of age, sex and experience in pheno- 
mena such as tool standardisation, connecting artefacts to social roles 
and ontogeny. All such data help clarify those characteristics of tech- 
nology that differentiate primates from other tool-using species. 
Crucially, excavation of primate activity areas adds time depth to these 
observations; other than for the later hominins, the span of time over 
which any species has used tools is currently unknown. 

Recent years have seen the first observations of wild tool use by 
populations of orangutans, capuchins and macaques, and a dramatic 
increase in our knowledge of the range, variability and time depth of 
chimpanzee material culture. We have also seen the announcement 
of the world’s oldest hominin tools and the revelation of unexpected 
skill in their manufacture. With the likelihood of even earlier homi- 
nin tool use, the proposed addition of new taxa to the hominin family 
tree’”’>’° has added new complexity to our evolutionary history. 
Primate archaeology provides the necessary methodology and scope 
to incorporate these developments into a coherent framework for the 
systematic location and interpretation of evolutionarily significant 
material culture, opening up new avenues in our understanding of 
the place of technology in primate and human societies. 
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The Schistosoma japonicum genome 
reveals features of host-parasite interplay 


The Schistosoma japonicum Genome Sequencing and Functional Analysis Consortium* 


Schistosoma japonicum is a parasitic flatworm that causes human schistosomiasis, which is a significant cause of morbidity in 
China and the Philippines. Here we present a draft genomic sequence for the worm. The genome provides a global insight into 
the molecular architecture and host interaction of this complex metazoan pathogen, revealing that it can exploit host 
nutrients, neuroendocrine hormones and signalling pathways for growth, development and maturation. Having a complex 
nervous system and a well-developed sensory system, S. japonicum can accept stimulation of the corresponding ligands as a 
physiological response to different environments, such as fresh water or the tissues of its intermediate and mammalian 
hosts. Numerous proteases, including cercarial elastase, are implicated in mammalian skin penetration and haemoglobin 
degradation. The genomic information will serve as a valuable platform to facilitate development of new interventions for 


schistosomiasis control. 


Schistosomiasis is an ancient scourge of mankind, depicted graphically 
in papyri from Pharaonic Egypt and known from human remains over 
2,000 years old from China'’. Blood-dwelling trematodes (phylum 
Platyhelminthes) of the genus Schistosoma cause this chronic and 
debilitating disease, which afflicts more than 200 million people in 
76 tropical and subtropical countries. Morbidity is high and schistoso- 
miasis contributes to several hundreds of thousands of deaths 
annually*>. Three principal species can infect humans: Schistosoma 
japonicum, Schistosoma mansoni and Schistosoma haematobium. The 
first of these is prevalent in the Philippines and parts of Indonesia, and 
is a major disease risk for 66 million people living in southern China’. 
It remains a major public health concern in China despite over 50 years 
of concerted campaigns for its control**. Approximately one million 
people in China, and more than 1.7 million bovines and other 
mammals, are currently infected’. Control measures include 
community-based praziquantel chemotherapy, health education, 
improved sanitation, environmental modification and snail control. 
However, additional approaches, such as the development and deploy- 
ment of new drugs and anti-schistosome vaccines are urgently 
needed to meet the prevailing challenges, which include the spectre 
of praziquantel-resistant parasites”®. 

During their complex developmental cycle, schistosomes alternate 
between a mammalian host and a snail host through the medium of 
fresh water. After burrowing out of the snail host, free-swimming 
cercariae penetrate the skin of the mammalian host, travel through 
the blood to the liver via the lungs, and transform into schistosomula. 
These mature in the hepatic portal vein, mate and, in the case of 
S. japonicum, migrate to their final destination in the mesenteric 
venous plexus. Female worms release thousands of eggs daily, which 
are discharged in the faeces after a damaging passage through the 
intestinal wall. If they reach fresh water, eggs hatch to release free- 
swimming ciliated miracidia, which, guided by light and chemical 
stimuli, seek amphibious snails of the genus Oncomelania. Within 
the hemocoel of the snail, miracidia give rise asexually to numbers of 
sporocysts, in which further asexual propagation produces numerous 
cercariae. 

Eggs deposited by adult female schistosomes embolize in the liver, 
intestines and other tissue sites and are the key contributors to the 


pathology and associated morbidity of schistosomiasis. Notably, the 
highly adapted relationship between schistosomes and their snail 
intermediate and mammalian definitive hosts appears to involve 
exploitation by the parasite of host endocrine and immune signals”"®. 
The evasion strategies that underpin avoidance of the host immune 
system, allowing schistosomes to survive for years despite strong host 
immune responses, have long interested investigators intent on 
development of an efficacious vaccine. 

Unlike most other platyhelminths, schistosomes are dioecious. 
The genome is arrayed on eight pairs of chromosomes, seven pairs 
of autosomes and one pair of sex chromosomes. Females are the 
heterogametic sex (ZW); males are homogametic (ZZ)''"'”. No other 
lophotrochozoan”’ has yet been sequenced. 


Genome features and evolution 

General information. The whole-genome shotgun (WGS) sequencing 
strategy was used to decode the 397-megabase-pair (Mb) sequences, 
covering most (>90%) of the S. japonicum genome (Supplementary 
Tables 1 and 2 and Supplementary Fig. 1). A total of 13,469 protein- 
coding genes were identified, comprising about 4% of the draft 
S. japonicum genome (Supplementary Figs 2 and 3). Of the protein- 
coding genes, 6,972 (52%) were mapped to categories established by the 
Gene Ontology project (Fig. la and Supplementary Fig. 4) and an 
orthologue relationship existed between 2,516 (19%) of them and 
1,546 Kyoto Encyclopedia of Genes and Genomes (KEGG) orthology 
groups (Supplementary Fig. 5). Schistosoma japonicum has a relatively 
large genome and low gene density in comparison with other inverte- 
brates, including Brugia malayi (Table 1). On the basis of the outbred 
source of the genomic libraries, high-quality discrepancies found 
during assembly were used to identify 557,739 single nucleotide poly- 
morphisms (SNPs) (Supplementary Table 3), with an average density of 
1.4 SNPs per kilobase pair, and the insertion and deletion (indel) rates 
were much lower. 

Repeat sequences. A total of 657 different repeat families/elements, 
constituting 159Mb (40.1%) of the S. japonicum genome were 
revealed by comparing known repetitive sequences and using the 
software REPEATSCOUT (version 1.0.3)'* (Fig. 2 and Supplemen- 
tary Table 4). Among them, 29 kinds of retrotransposon were found, 


*Lists of participants and their affiliations appear at the end of the paper. 


345 


©2009 Macmillan Publishers Limited. All rights reserved 


ARTICLES 


a Biological 
regulation 
(16%) 


Metabolism (34%) Locomotion (1%) 


Immune system (1%) 


Cellular 


Localization homeostasis (1%) 


9 
(14%) Biological 


\ ~ adhesion (2%) 


Cellular component Development \. 


organization (11%) *% 
and piegenests \ - Reproduction (3%) 
() \ 
\\_ Response to 
stimulus (5%) 
b Domains in one 


specific taxon 
(unique domains) 


Domains shared 
by two taxa 


Domains shared 
by three taxa 


| 


| | | | j | 
0 1,000 2,000 3,000 4,000 5,000 6,000 
Number of domains 


Domains shared 
by four taxa 


Domains shared 
by five taxa 


Figure 1 | Functional categorization of S. japonicum genes and protein- 
domain analysis. a, Proportion of the 6,972 S. japonicum proteins with 
functional information in different Gene Ontology categories. b, In 

S. japonicum, vertebrates (H. sapiens, G. gallus and D. rerio), insects 

(D. melanogaster and A. gambiae), C. elegans and Nematostella vectensis, a 
total of 7,562 domains were detected. The majority of S. japonicum domains 
are shared with other taxa, having the fewest unique domains, whereas 
vertebrates evolved significant numbers of unique protein domains. 


including known Gulliver, SjR1, SjR2 and Sj-pido elements as well as 
25 novel elements, together constituting 19.8% of the genome 
(Supplementary Table 5). Of the 25 novel retrotransposons, 18 are 
long terminal repeat (LTR) forms, four are non-LTR forms and three 
are Penelope-like elements—enigmatic retroelements that retain 
introns'’. Each type of retrotransposon was represented by one to 
793 intact copies or hundreds to thousands of partial copies. The 
non-LTR retrotransposons have significantly higher copy numbers, 
constituting 12.6% of the genome. 
Gene loss/duplication. It was intriguing to observe that schisto- 
somes share more orthologues with the vertebrates (Supplementary 
Table 6), such as H. sapiens (4,324 pairs), than they do with the 
ecdysozoans, for example C. elegans (3,292), despite the Ecdysozoa 
and Lophotrochozoa being phylogenetically adjacent’’. Similarly, the 
cnidarians and vertebrates have been shown to share more ortholo- 
gous genes with each other than either does with the ecdysozoans”®. 
One possible reason for this is that a higher evolutionary rate in the 
Ecdysozoa causes an apparently larger orthologue divergence, 
although the scenario of functional selection of orthologue patterns 
in the context of parasite—host interplay is also worth consideration. 
To test possible consequences of parasitism at the genome level, we 
investigated gene family and domain variations between schisto- 
somes and other metazoans. It is clear that there was minor variation 
in total numbers of protein families among S. japonicum (6,322) and 
the other species, such as C. elegans (6,669), D. melanogaster (5,184) 
and H. sapiens (6,877) (Supplementary Table 7). However, a major 
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reduction in number, or even the elimination, of protein domains 
was apparent in the S. japonicum genome, in that the great majority 
(3,654) of the 3,728 protein domains from the flatworm were shared 
with other species (Fig. 1b) and can thus be considered ubiquitous 
among metazoans, whereas 3,834 domains found in at least one of the 
other species were not detected in schistosomes. Of these 3,834 
domains, 1,140 were shared by more than three taxa of vertebrates, 
insects, a nematode and sea anemones (Supplementary Fig. 6). 
Notably, domain-loss events seem to be more widespread in 
S. japonicum than in any other species studied so far, including 
C. elegans, a model organism well known for rapid evolutionary rates 
and a high degree of gene loss'’. Roughly 1,000 protein domains have 
been abandoned by S. japonicum, including some involved in basic 
metabolic pathways and defence, implying that loss of these domains 
could be, at least partly, a consequence of the adoption of a parasitic 
way of life. 

Against the background of extensive gene/domain loss, the finding 
of expanded gene families in schistosomes might provide clues to the 
requirements for a parasitic lifestyle. Among the most expanded gene 
families in schistosomes (Supplementary Tables 8 and 9), that encod- 
ing leishmanolysin (a major surface protease, also called gp63), a 
member of the metallopeptidase M8 family, has 12 putative family 
members in S. japonicum, but there is only one in human, fruit fly and 
nematode (C. elegans), and only three putative counterparts in the 
free-living flatworm Schmidtea'* (Supplementary Information). In 
addition to elastase (see later), leishmanolysin-like proteases may 
contribute to tissue invasion by schistosome cercariae’’. 


Development and metabolism 

Cellular signalling pathways in development. To investigate 
regulatory networks involved in embryonic development and orga- 
nogenesis, we undertook comparative genomics analysis of well- 
characterized signalling pathways, including those for Wnt, notch, 
hedgehog and transforming growth factor B (TGF-f). Notably, the 
S. japonicum genome encodes these growth factors, receptors and 
essential components to regulate many cellular processes during 
organogenesis and tissue development (Fig. 3 and Supplementary 
Tables 10 and 11). Schistosoma japonicum also encodes endogenous 
epidermal growth factor (EGF)-like and fibroblast growth factor 
(FGF)-like peptides (Fig. 3). The intact downstream cascade 
composed of the Ras—Raf—mitogen-activated protein kinase 
(MAPK) and TGF-B—SMAD signalling pathways, including FGF- 
and EGF-receptors, has components sharing high identity with 
mammalian orthologues, which implies that schistosomes, in addi- 
tion to using their own pathways, can exploit host growth factors as 
developmental signals. Indeed, we have identified an insulin receptor 
with high sequence similarity with those of mammals”’, whereas no 
insulin growth factor or insulin molecules were found, further sup- 
porting the notion that schistosomes exploit key signalling pathways 
of their hosts for growth and metabolism. 

Metabolic pathways. Analysis of the KEGG pathways assigned to 
metabolic process (Supplementary Table 12 and Supplementary 
Figs 7 and 8) indicates that S. japonicum can use carbohydrates as 
energy/carbon sources. It is unable to de novo synthesize fatty acids, 
sterols, purines, nine human essential amino acids, arginine or tyro- 
sine (Supplementary Figs 9-11). Loss or degeneracy of fatty acid, 
sterol and purine synthesis pathways in schistosomes is probably a 
consequence of the adoption of a parasitic lifestyle; notably, the genes 
encoding all the key enzymes for both the de novo fatty acid and 
purine syntheses are complete in the free-living flatworm, Schmidtea 
mediterranea'® (Supplementary Information). To obtain essential 
lipid nutrients, the S. japonicum genome indeed encodes many trans- 
porters, including apolipoproteins, low-density lipoprotein receptor, 
scavenger receptor, fatty-acid-binding protein, ATP-binding-cassette 
transporters and cholesterol esterase (Supplementary Table 13), to 
exploit fatty acids and cholesterol from host blood and plasma. 
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Table 1| Summary of S. japonicum genomic features in comparison with oth 


ARTICLES 


er organisms 


Genome features Lophotrochozoa Ecdysozoa Deuterostomia 
Schistosoma Caenorhabditis Brugia Drosophila Anopheles Gallus Danio Homo 
japonicum elegans malayi melanogaster gambiae gallus rerio sapiens 
Total genome size (Mb) 398 100 88t 169 265 1,051 1,527 3,255 
Total GC content (%) 34.1 35.4 30.5" 40.2 42.2 Ai? 34.4 36.2 
GC content in coding regions (%) 36 42.7 39.67 53.3 443 42.9 34.4 a7 5 
GC content in intron regions (%) 33.8 32.1 27.6" 39.5 43.9 41.3 34.1 36.6 
GC content in intergenic regions (%) 34.7 35.4 30.97 40.2 42.2 41.2 34.4 36.2 
Repeat rate (%) 40.1 18.3 ~15' 24.7 16.6 10.8 46.6 44 
Total coding size (Mb)/ratio (%) 15.9/4 24.8/25 12.9/15 21/7/13 14.8/6 25.7/2 47.9/3 35.9/1 
umber of coding genes 13,469 20,077 11,515 14,144 12,527 18,107 35,321 25,077 
Gene density (genes per Mb) 34 200 130 84 47 iy 23 8 
Average CDS size (kb) 1.18 1,23 Li? 1.54 18 42 1.36 1.43 
Average gene size + s.d. (kb) 10.5 + 16.0 2.8 +3.2 2842.9 43+ 10.6 46+ 10.0 2162468 19.5+35.1 413+498.0 
umber of transfer RNAs 153 608 (~590*) ~233" 314 450 189 2,010 129 
umber of ribosomal RNAs 184 19 (~275*) ~400* 161 N/A 14 N/A 675 
| coding sequence (CDS)-related features were calculated from the KEGG database, version 46 (ftp://ftp.genome.jp/pub/kegg/release/archive/kegg/46/). Caenorhabditis elegans and B. malayi 
enome feature files were downloaded from Wormbase (ftp://ftp.wormbase.org/pub/). Other genome-feature files and genome sequences were downloaded from Ensembl (ftp:// 
p.ensembl.org/pub/release-49). 


The features calculated from the feature file are different from those in the reference°. 
Data obtained from previous publications*’. 


+ * spo D 


Nervous system and neuroendocrine system 

Platyhelminths possess a central nervous system with a variety of 
sensory structures that can transduce a wide range of stimuli, and 
use a neuroendocrine system to regulate growth, metabolism and 
homeostasis. 

Neurotransmitters and receptors. We characterized a number of 
receptors and transporters of neurotransmitters (Supplementary 
Table 14) that may be required, for example, by miracidia and 
cercariae to navigate through water to locate new hosts and for the 
schistosomula and adult flukes to establish and reproduce within the 
human vasculature. In addition to known neurotransmitters and 
receptors, we have identified a receptor for octopamine (Sup- 
plementary Table 14) and two key enzymes for synthesis of octopa- 
mine (Supplementary Fig. 12). 

The nervous systems of flatworms can be considered to be predo- 
minantly peptidergic’. We found additional putative neuropeptide 
receptors for opioids, galanin and melatonin. Thus, it appears that 
schistosomes can accept stimulation of the corresponding ligands as 
a physiological response to different environments, such as fresh 
water or the tissues of their snail and mammalian hosts. There are 
genes encoding receptors predicted to accept gastrointestinal neuro- 
peptide hormone signals including cholecystokinin, secretin, gastric 
inhibitory polypeptide and xenin, all of which are involved in func- 
tions promoting the release of alimentary tract fluids containing 
digestive enzymes. 


We also identified receptors for urotensin, angiotensin II and neu- 
romedin (types U and B), which have an important role in physio- 
logical regulation of the cardiovascular system, the hypothalamus 
and other vertebrate organs. Although schistosomes do not have 
these organs, these components could have other effects on the cells 
or tissues of the blood fluke, such as the regulation of cell growth or in 
tissue remodelling. In addition, we found receptors for hypocretin 
(orexin), leptin and hypothalamic neuropeptides. Together, these 
features suggest that schistosomes have many advanced physiological 
features regarded as more characteristic of higher metazoans. 

Unexpectedly, a myokinin-like receptor was also observed 
(Supplementary Table 14). Myokinins are invertebrate neuropep- 
tides with myotropic and diuretic activities for which a receptor, 
called lymnokinin receptor, was first identified in the tick 
Boophilus microplus*. The discovery of such a receptor in schisto- 
somes supports the notion that they might synthesize myokinin 
because their vertebrate hosts do not produce this neuropeptide. 
Additional examples of receptors found for other invertebrate neu- 
ropeptides included FMRFamide and myosuppressin***, both 
belonging to the FMRFamide-like peptide superfamily. 

Complex sensory system. Schistosomes have a variety of sensory 
structures using which they, during their different life stages, pre- 
sumably respond to a myriad of environmental stimuli. Free-living 
cercariae and miracidia can sense light, mechanical stimuli and tem- 
perature”, facilitating the finding of hosts, whereas the parasitic adult 
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Figure 2 | The distribution of categories and composition of repeat elements in the S. japonicum genome. Retrons, retrotransposons; SINE, short 
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Figure 3 | Putative signalling pathways for growth, development and 
neuroactive ligand-receptor interaction in S. japonicum. The pathways for 
growth and development (indicated with different colours), and the 
neuroactive ligand-receptor interactions in S. japonicum are shown on the 
left and right, respectively. TACE, tumour-necrosis-factor-o-converting 
enzyme; ProC, porcupine homologue (Drosophila); NICD, notch 
intracellular domain; FRP, frizzled-related protein 1; GSK3B, glycogen 
synthase kinase 38; TCF, transcription factor 7; ‘p’ within cycle, 


worms are able to respond to changes in levels of chemicals and 
nutrients. Using a top-down Gene-Ontology-based strategy to 
facilitate the gene annotation (Supplementary Fig. 13), we identified 
71 genes encoding receptors, membrane channels, enzymes and other 
components, such as rhodopsins/opsins”, phosrestins/arrestins”’, 
transducins, cyclic nucleotide-gated channel, rhodopsin kinase and 
guanylate cyclase 2D (Supplementary Table 15). Both S. japonicum 
and S. mansoni have only two members of the rhodopsin family, 
unlike Drosophila, which possesses 13 members, and zebrafish, which 
has at least seven (Supplementary Fig. 14a). Phylogenetic analysis 
indicated that there are at least four schistosome transducins, each 
of which could represent a divergent subtype of transducin super- 
family across chordates, echinoderms, molluscs and arthropods 
(Supplementary Fig. 14b), and could therefore mediate distinct res- 
ponses of sensors to signals. 

The genome sequence analysis also revealed an array of genes encod- 
ing sensory proteins that could interact with chemical ligands and 
other stimuli. These included guanine-nucleotide-binding protein, 
potassium-voltage-gated-channel protein Shaker, the glutamate recep- 
tor for umami taste and protein Prospero** (Supplementary Table 16 
and Supplementary Fig. 15a). Notably, the genome encodes most com- 
ponents of four of the five human gustatory sensation pathways: the 
salty, sour, sweet and umami tastes. We also found several potential 
sensors for sound perception, a common characteristic of vertebrates 
and arthropods”, in the genome (Supplementary Table 17). 

We discovered an apparently intact olfaction pathway, including 
cyclic nucleotide-gated olfactory channel, guanine-nucleotide- 
binding protein and adenylyl cyclase type 3 (Supplementary 
Table 18 and Supplementary Fig. 15b). Moreover, mechanosensory 
perception mediated by mechanically gated ion channels represents 
the basis for the sensing of touch, balance, temperature and sound, 
and contributes essentially to the development and homeostasis of all 
Eumetazoa*’ (Supplementary Table 19). Putative sensory compo- 
nents for equilibrium/balance, mechanical stimulation, pain 
and temperature (Supplementary Tables 20 and 21) were also 
found in the S. japonicum genome, including two proteins that have 
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phosphorylation on the proteins indicated; BMP, bone morphogenetic 
protein; IGF, insulin-like growth factor; mGlu, metabotropic glutamate; 
GlyR, glycine receptor; GluR, glutamate receptor; HR, histamine receptor; 
GABA, y-aminobutyric acid; DR, dopamine receptor; 5-HT, 
5-hydroxytryptamine; HTR, 5-hydroxytryptamine receptor; NPYR, 
neuropeptide Y receptor; AChR, acetylcholine receptor; RyR, ryanodine 
receptor; OAR, octopamine receptor; ZIC2, Zic family member 2; CI, 
cubitus interruptus; suffix ‘R’ denotes receptor. 


similarities with the well-known mechanosensory protein, transient 
receptor potential cation channel’, and several receptors such as 
metabotropic glutamate receptor 3, which participate in the sensory 
perception of pain, light and taste. 

Neuroendocrine system. Schistosomes have receptors that apparently 
evolved to accept endogenous hormones as well as those of the para- 
sitized mammalian host****. By surveying hormones and receptors 
related to the classical neuroendocrine axis in the genomic sequence 
of S. japonicum, we found (Fig. 4) putative receptors for hypothalamic 
hormones such as thyrotropin-releasing hormone (TRH), prolactin- 
releasing hormone, somatostatin, melanin-concentrating hormone and 
leptin, as well as transmembrane proteins that have some similarities 
with receptors for gonadotropin-releasing hormone, corticotropin- 
releasing hormone and growth-hormone-releasing hormone. More- 
over, putative receptors are present that show weak similarity with those 
in mammals for the pituitary hormones thyroid-stimulating hormone 
(TSH), luteinizing hormone, follicle-stimulating hormone, arginine 
vasopressin and oxytocin. 

Although a hypothalamus-pituitary-like organ has not been 
described in schistosomes, it is possible that some neurons, similar 
to those in the hypothalamus and pituitary of vertebrates, could fulfil 
similar functions in terms of modulating the behaviour of 
S. japonicum through peripheral endocrine tissues and cells. In this 
regard, it is noteworthy that the genomic information suggests the 
presence of an integral hypothalamic—pituitary—thyroid axis in 
S. japonicum. In addition to the superior TRH-TSH receptors, an 
intact system for synthesis of thyroxine and active triiodothyronine, 
as well as an inactivation mechanism of these hormones using deio- 
dination, was identified. Nuclear receptors for triiodothyronine and 
thyroxine were revealed with identity to mammalian orthologues. 
Hence, S. japonicum may use an endogenous thyroid hormone/ 
receptor signalling pathway for growth and development (Fig. 4 
and Supplementary Table 22). 

We confirmed that S. japonicum has receptors for steroid hor- 
mones such as progestin, progesterone and oestrogen**”’. In addi- 
tion, it possesses intricate pathways for processing steroid hormones 
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Figure 4 | Putative neuroendocrine system in S. japonicum. Structured 
according to the proposed hypothalamus—pituitary—peripheral-endocrine- 
glands axis with putative ligands found in S. japonicum coloured in orange 
and S. japonicum receptors in yellow. CRH, corticotrophin-releasing 
hormone; GHRH, growth-hormone-releasing hormone; TRH, thyrotropin- 
releasing hormone; PRH, prolactin-releasing hormone; GnRH, 
gonadotropin-releasing hormone; TSH, thyroid-stimulating hormone; FSH, 
follicle-stimulating hormone; LH, luteinizing hormone; suffix ‘R’ denotes 
receptor. 


to form other sex hormones. For example, there are putative enzymes 
present that could convert the female hormones progesterone and 
pregnenolone to estriol, oestrone, androsterone and testosterone. 
Hence, schistosomes might use these pathways during their parasitic 
existence. Schistosoma japonicum also encodes enzymes to catabolize 
excessive or used steroid hormones such as aldosterone. 
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Figure 5 | S. japonicum proteases and elastase. a, The pie chart shows the 
distribution of the five kinds of protease. b, The genomic structure of 

S. japonicum cercarial elastase (SjCE). c, A phylogeny of the elastase family 
in schistosomes using the neighbour-joining method. Bootstrap values are 
provided above the branches. SmCE, S. mansoni cercarial elastase; ShCE, 
S. haematobium cercarial elastase; SdCE, Schistosomatium douthitti elastase. 
d, Immunofluorescence assay showing the presence (white arrow) of SjCE 
around a schistosomulum following its penetration through mouse skin 
(panel 2). A naive rabbit serum was used as negative control (panel 4). The 
location of the cercaria is indicated (white arrow). Panels 1 and 3 show the 
skin tissue slices under the optical microscope. 
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With regard to the process of glycolysis for essential energy supply, 
receptors for adiponectin, an insulin-sensitizing hormone™, and lep- 
tin, a suppressor of the secretion of insulin”, are also encoded by the 
genome of S. japonicum (Supplementary Table 22), providing further 
support for the notion that the blood fluke modulates its energy 
metabolism in response to either its own insulin-like hormones or 
those of its mammalian host. 

The schistosomulum renews its tegument during maturation into 
an adult schistosome under the effects of ecdysone**”’. In concordance, 
we identified an ecdysone-like receptor and its downstream effector 
ecdysone-induced protein 78C. In addition, allatostatin, a polypeptide 
hormone that suppresses the secretion of juvenile hormone, was previ- 
ously reported to be found throughout the schistosome nervous 
system***’. An allatostatin-like receptor sequence that has high 
similarity with that of the cockroach was also identified (Supplemen- 
tary Table 22). 


Disease pathogenesis 
Cercarial elastase and protease superfamily. Schistosome proteases 
have key roles in invasion*’, migration*' and feeding/nutrition’. We 
identified 314 putative proteases, including metallo-, cysteine, serine, 
threonine and aspartic proteases, in the S. japonicum genome data set 
(Fig. 5a and Supplementary Tables 23-27) by searching in the 
MEROPS database of peptidases. We classified 108 S. japonicum 
metalloproteases into 21 subtypes, 16 belonging to the aminopepti- 
dases (Supplementary Table 23). Notably, the leucine aminopeptidase 
of the M17 family was reported as a major egg antigen**** and a 
putative anti-fluke vaccine’. The second largest assemblage com- 
prised the cysteine proteases, of which 102 members were assigned 
to 17 subtypes (Supplementary Table 24). Among them, the cathe- 
psins B, C, F and L have pivotal roles in schistosome feeding and 
nutrition’, as well as in migration through human tissues*'. The 
cysteine proteases cathepsins K and S, as well as the cathepsin A serine 
protease, have not previously been recognized in schistosomes, and 
may contribute to catabolism of haemoglobin and other host proteins. 
Among the 65 serine proteases (Supplementary Table 25), we dis- 
covered a S. japonicum cercarial elastase (SjCE), an enzyme that in 
S. mansont is vital in the penetration by cercariae of mammalian skin 
to initiate infection*®**. The elastase locus predicted from the 
S. japonicum genome spans three exons and two introns, similar to 
the known S. mansoni elastases*’ (Fig. 5b); however, unlike for 
S. mansoni, only a single elastase was identified in S. japonicum. 
Phylogenetic analysis of available schistosome elastases (Supplemen- 
tary Table 28) suggested that the elastase genes in S. mansoni have 
expanded through at least two rounds of gene duplication, whereas 
SjCE is an orthologue of S. mansoni cercarial elastase 2b (Fig. 5c). 
Moreover, by re-examination of mass spectra data that we collected 
previously**, we identified a unique peptide (IAFLALSDFDHR) of 
SjCE in cercariae (Supplementary Fig. 16a). We also confirmed the 
existence of SjCE gene products in both the sporocyst and cercarial 
stages of S. japonicum by immunoblot and immunofluorescence 
assays (Fig. 5d and Supplementary Fig. 16b). In addition, the native 
protease was recognized by anti-recombinant SjCE antibodies in 
infected mouse skin, indicating that this cercarial elastase is 
secreted/released by the parasite during invasion of mammalian skin. 
Immune system and inflammatory factors. The immune system of 
S. japonicum has to face both invading microbial pathogens and the 
immune statuses of both its molluscan and mammalian hosts. 
Although adaptive immune molecules such as immunoglobulin are 
lacking in S. japonicum and a classical Toll-like receptor was not 
found, putative Toll-interacting protein or proteins containing 
Toll/interleukin-1 resistance motif or leucine-rich repeats appear 
to be present (Supplementary Table 29). Therefore, schistosomes, 
like nematodes, appear to possess a primordial Toll pathway as a first 
line of defence against microbial infections. The identification of the 
downstream components of a Toll-related pathway, including puta- 
tive interleukin-1-receptor-associated kinases, toll-like receptor 
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adaptors, TNF-receptor-associated factor 6 (TRAF6), inhibitor of 
nuclear factor kB kinase subunit epsilon (IKK-e) and p38 MAPK, 
further support the view that this primitive innate immune system 
could be crucial for the worm (Supplementary Table 30). 

On the other hand, factors and metabolites in S. japonicum that 
could contribute to stimulation and regulation of mammalian 
immunity were discovered. It is well accepted that glycans and lipids 
synthesized by adult schistosomes or eggs may regulate secondary 
signals through corresponding receptors on effector cells and access- 
ory cells of the mammalian host, thus compromising host immuno- 
logical defences targeting the parasite. We therefore searched for 
enzymes involved in the metabolism of various glycans or lipid anti- 
gens by interrogating this worm genome. It turned out that, with the 
rare exception of enzymes such as o1,3-mannosyltransferase, a com- 
plete set of enzymatic machinery for biosynthesis and modification of 
glycans and lipids exists (Supplementary Table 31). 

In addition, prostaglandins, which are well-known mediators of 
inflammation, can be synthesized by S. japonicum as a result of ara- 
chidonic-acid metabolism. It is feasible that S. japonicum synthesizes 
arachidonate by using lecithin, converting the arachidonate into leu- 
kotriene A4 using arachidonate 5-lipoxygenase, followed by the con- 
version of unstable leukotriene A4 into the active chemical 
leukotriene B4 through leukotriene A4 hydrolase. The S. japonicum 
genome also encodes putative receptors for leukotriene B4, cysteinyl 
leukotriene and prostaglandins E2 and F2, suggesting that prosta- 
glandins could have an important role in the physiology of schisto- 
somes and also in the host—parasite interplay. Unexpectedly, 
S. japonicum possesses proteins paralogous to mammalian auto- 
immune-disease-related autoantigens; these include 69 kDa islet cell 
autoantigen (ICA1), islet antigen-2 (PTPRN) and glutamate decar- 
boxylase (GAD), known autoantigens related to type-I diabetes in 
B-cells, which raises the possibility that these autoantigen-mimicking 
molecules could induce chemokine-receptor-mediated cell migra- 
tion and initiate leukocyte migration into inflamed tissue, which 
ultimately contribute to the granuloma formation that promotes 
parasite survival. 


Concluding remarks 


Lophotrochozoa, of which S. japonicum is a member, is a large taxon 
that includes ~50% of all animal phyla. Our work provides a model 
for evaluating the genomic architecture, biology and evolution in this 
major taxon. Although the genome of S. japonicum has undergone 
significant protein-domain-loss events, a detailed molecular rep- 
ertoire exists to permit the pathogen to locate and penetrate hosts, 
nourish itself and interact with the environment and its host. With 
the release and analysis of the S. mansoni genome“, a comparative- 
genomics approach elucidating the similarities and differences 
between these two closely related parasites will provide more clues 
regarding these important pathways. Further functional analysis, 
using approaches such as RNA interference and translational studies 
are essential to resolve uncertainties in the molecular physiology of 
schistosomes and to illuminate mechanisms of pathogenesis in schis- 
tosomiasis, efforts that may lead to the development of new inter- 
ventions for its control and eventual elimination. 


METHODS SUMMARY 


We obtained adult worms and eggs of S. japonicum from infected rabbits. The 
genomic DNA was extracted from ~ 1,000 mixed, outbred adult male and female 
S. japonicum, perfused from rabbits infected with cercariae released by naturally 
infected snails. Genomic libraries, including bacterial artificial chromosome 
(BAC), fosmid and plasmid libraries, were constructed. We performed WGS 
sequencing on capillary sequencers, and then used a modified PHUSION (ver- 
sion 2.1c) package to assemble the reads. Protein-encoding genes were predicted 
using EXONHUNTER (version 2.0)*’. We used a stepwise method to predict the 
gene functions. The metabolic and regulatory pathway of S. japonicum was 
reconstructed with reference to the KEGG pathway database. Proteins were first 
clustered using a Markov cluster algorithm and then merged according to 
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protein-domain information to establish protein-family clusters. We used 
immunoblot and immunofluorescence assays to detect cercarial elastase. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 

Schistosoma japonicum genomic and full-length cDNA library construction. 
Genomic DNA was extracted from ~1,000 mixed, outbred adult male and 
female S. japonicum, perfused from rabbits infected with cercariae released by 
naturally infected snails collected from an endemic focus in Anhui Province, as 
described”’. Four genomic libraries with different insert sizes were constructed, 
one of bacterial artificial chromosomes (inserts, 80-120 kb), one of fosmids (36- 
42 kb) and two of plasmids (6-10 kb and 1.6-4kb) (Supplementary Table 1). 
Total RNAs from S. japonicum adults and eggs were isolated using Trizol 
(Invitrogen), after which mRNA was purified using the Poly(A) Purist mRNA 
Purification Kit (Ambion). Two full-length cDNA libraries, from adults and eggs 
were constructed using a modified biotinylated CAP-trapper approach™”?. 
WGS sequencing and assembly. After the clone ends of four discrete genomic 
libraries were sequenced by capillary DNA sequencers ABI3700 (Applied 
Biosystems) and MegaBACE 1000 or MegaBACE 4000 (General Electric), 
PHRED (version 0.020425.c)**” was used for base calling. All reads were qualified 
by removing clone vector and bacterial host sequences, as well as the host rabbit 
(Oryctolagus cuniculus) DNA sequences (http://www.ensembl.org/Oryctolagus_ 
cuniculus/index.html). A modified PHUSION (version 2.1c) package”® was used 
for assembly. 

Repeat and retrotransposon identification. A repetitive sequence library of 
S. japonicum was generated by the method of consensus seed extending using 
REPEATSCOUT (version 1.0.3)", with the k-mer size of 16. Tandem repeats in 
the genome were identified using TANDEM REPEATS FINDER (version 4.00)°” 
and categorized using the tandem repeats analysis program TRAP (version 1.0)”. 
Microsatellites, minisatellites and satellites are classically defined as repeat units of 
1-6 bp, 11-100 bp and more than 100 bp, respectively. Polyprotein and reverse 
transcriptase from GenBank were used as queries to search genome sequences of 
S. japonicum using tBLASTN (e-value = 10° '°). The best hit sequences were then 
used to query the genome, and those yielding multiple hits in the genome were 
categorized as candidate retrotransposons. All candidate retrotransposons were 
assembled to establish complete CDSs encoding polyprotein or reverse transcrip- 
tase. Once the complete CDS was determined, sequences upstream and down- 
stream of this CDS in the genome were analysed to identify LTRs which flank the 
left and right termini of LTR retrotransposons and retroviruses. 

Prediction and integration of protein-coding genes. Protein encoding genes 
were predicted using EXONHUNTER (version 2.0)”. The prediction program 
combined ab initio gene prediction with supporting evidence from S. japonicum 
and S. mansoni expressed sequence tags, S. japonicum pair-end ditags, the Swiss- 
Prot protein database” and the Pfam protein-domain database (version 22.0). 
Because there were few training sets available for S. japonicum or for any other 
closely related species, we developed an iterative method that started from the 
distantly related species C. elegans, and progressively improved parameters of the 
gene finder on the basis of well-supported predicted gene fragments. The pre- 
dicted genes were merged with putative expressed sequence tags and full-length 
cDNA-derived CDSs (proteins), yielding an integrated protein-coding gene set 
for further functional analysis. These genes were classified into categories estab- 
lished by the Gene Ontology project through the encoding proteins or domains 
matched to the Gene Ontology index provided by UniProt® and InterPro” 
(iprscan_DATA_17.0 and iprscan_PTHR_DATA_ 14.0). 

Genome variation analysis. The PHUSION assembler® does not provide align- 
ment information of reads to its contig consensus, so BLASTN was used to 
relocate reads to contig consensus, with overall identity of over 95%, and to 
provide alignment information. We established a locally developed SNP pipeline 
based on neighbourhood quality standard, with the following rules: for each 
candidate SNP on shotgun reads, the 5-bp flanking sequences should be the same 
as the contig consensus, the base quality on the SNP site should be no less than 23 
and the base quality of the flanking 5 bp should be not less than 15 (refs 63, 64). 
Pathway mapping. The metabolic and regulatory pathway of S. japonicum was 
reconstructed on the basis of the KEGG pathway database®. The KEGG orthol- 
ogy identifier was used as a linkage between genes and pathways. The assignment 
of S. japonicum genes to KEGG orthologues was implemented with a modified 
bidirectional-best-BLAST-hits method, which was adjusted using phylogenetic 
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information. The pathway mapping results for the S. japonicum genome are 
available at http://chgc.sh.cn/japonicum. 

Gene-family analysis. Proteins of S. japonicum, C. elegans, D. melanogaster, 
A. gambiae, D. rerio, G. gallus, H. sapiens and N. vectensis were first clustered 
using a Markov cluster algorithm® and then merged according to protein- 
domain information to establish protein-family clusters. The S. japonicum 
protein domains were scanned using INTERPROSCAN®. Protein-domain 
information on other species was sourced from the KEGG database”. 
Analysis of S. japonicum proteases. Putative proteases in the S. japonicum data 
set were identified by comparing S. japonicum cDNA and predicted genes with 
the MEROPS database‘’. The results were manually checked and compared with 
annotations generated by BLAST searches against more comprehensive data- 
bases as above. Results with inconsistent annotations from MEROPS and BLAST 
were removed. For phylogenetic and evolutionary analyses of gene families, 
deduced amino-acid sequences were aligned using CLUSTALW (version 
1.83)°*. Phylogenetic trees were generated using MEGA (version 3.1)” with 
the neighbour-joining method and tested with 1,000 bootstrap replicates. 
Immunofluorescence assay of S. japonicum cercarial elastase. A mouse anaes- 
thetized with pentobarbital was infected with S. japonicum cercariae. After 
10 min, the skin was excised, finely diced, and embedded in OCT fixative. The 
prepared 7-\1m-thick frozen sections were incubated for 30 min in a solution of 
20% goat serum in Tris-HCl-buffered saline. The sections were incubated with 
the rabbit primary antiserum raised against purified recombinant SjCE or nor- 
mal rabbit serum, followed by a FITC-conjugated second antibody. Fluorescence 
was visualized using a Leica DM-2500 fluorescence microscope. 
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Schistosoma mansoni is responsible for the neglected tropical disease schistosomiasis that affects 210 million people in 76 
countries. Here we present analysis of the 363 megabase nuclear genome of the blood fluke. It encodes at least 11,809 genes, 
with an unusual intron size distribution, and new families of micro-exon genes that undergo frequent alternative splicing. As 
the first sequenced flatworm, and a representative of the Lophotrochozoa, it offers insights into early events in the evolution 
of the animals, including the development of a body pattern with bilateral symmetry, and the development of tissues into 
organs. Our analysis has been informed by the need to find new drug targets. The deficits in lipid metabolism that make 
schistosomes dependent on the host are revealed, and the identification of membrane receptors, ion channels and more than 
300 proteases provide new insights into the biology of the life cycle and new targets. Bioinformatics approaches have 
identified metabolic chokepoints, and a chemogenomic screen has pinpointed schistosome proteins for which existing drugs 
may be active. The information generated provides an invaluable resource for the research community to develop much 
needed new control tools for the treatment and eradication of this important and neglected disease. 


Schistosomiasis is a neglected tropical disease that ranks with 
malaria and tuberculosis as a major source of morbidity affecting 
approximately 210 million people in 76 countries, despite strenuous 
control efforts’. It is caused by blood flukes of the genus Schistosoma 
(phylum Platyhelminthes), which exhibit dioecy and have complex 
life cycles comprising several morphologically distinct phenotypes in 
definitive human and intermediate snail hosts. Schistosoma mansoni, 
one of the three major human species, occurs across much of 
sub-Saharan Africa, parts of the Middle East, Brazil, Venezuela and 
some West Indian islands. The mature flukes dwell in the human 
portal vasculature, depositing eggs in the intestinal wall that either 


pass to the gut lumen and are voided in the faeces, or travel to the liver 
where they trigger immune-mediated granuloma formation and 
peri-portal fibrosis’. Approximately 280,000 deaths per annum are 
attributable to schistosomiasis in sub-Saharan Africa alone’. 
However, the disease is better known for its chronicity and debilitat- 
ing morbidity*. A single drug, praziquantel, is almost exclusively used 
to treat the infection but this does not prevent reinfection, and with 
the large-scale control programmes in place, there is concern about 
the development of drug resistance. Indeed, resistance can be selected 
for in the laboratory and there are reports of increased drug tolerance 
in the field’. 
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In this study we present the sequence and analysis of the S. mansoni 
genome. Previous metazoan projects have been restricted to 
Deuterostomia (for example, Homo, Mus and Ciona) and the ecdy- 
sozoan clade of the Protostomia (for example, Drosophila, 
Caenorhabditis and Brugia). Together with the accompanying article 
on S. japonicum’, we present, to our knowledge, the first descriptions 
of metazoan genomes from the lophotrochozoan clade. The genome 
reveals features that aid our understanding of the evolution of com- 
plex body plans. We have mined the genome to predict new drug 
targets, on the basis of searches involving traditional areas for drug 
discovery, metabolic reconstruction, and bioinformatics screens that 
exploit shared pharmacology. It is hoped that these and other targets 
will accelerate drug discovery, generating the much needed new treat- 
ments for the control and eradication of schistosomiasis. 


Genome structure and content 


The nuclear genome sequence of S. mansoni was determined by whole- 
genome shotgun sequencing and assembled into 5,745 scaffolds greater 
than 2 kilobases (kb) (Supplementary Table 1), totalling 363 mega- 
bases (Mb). Although 40% of the genome is repetitive, 50% is 
assembled into scaffolds of at least 824.5kb. Furthermore, 43% of 
the genome assembly (distributed over 153 scaffolds) was unambi- 
guously assigned to chromosomes (seven autosomal, plus ZW sex- 
determination pairs) using fluorescence in situ hybridization (FISH; 
Fig. 1, Supplementary Fig. 1 and Supplementary Table 2). 

We identified 72 families of both long-terminal repeat (LTR) and 
non-LTR transposons, comprising 15% and 5% of the genome, 


ARTICLES 


respectively, and containing 63 and 60 new families each (Sup- 
plementary Table 3). The LTR transposons are from the Ty3/Gypsy 
and BEL clades, whereas the non-LTR transposons are restricted to 
the RTE, CR1 and R2 clades. Two previously described non-LTR 
retrotransposon families from the RTE clade (SR2 and Perere-3)”* 
seem to have undergone a burst of transposition events after diver- 
gence of S. mansoni and S. japonicum, and contribute to an overall 
higher representation of non-LTR retrotransposons in S. mansoni 
(15%, around 8% in S. japonicum). A new DNA transposon belong- 
ing to the Mu family was also found, which represents the first 
instance in a flatworm. The presence of target site duplications in 
some copies indicates recent transposition, and suggests that active 
copies may still exist in the genome. A lack of terminal inverted 
repeats, a feature of Mu family members, suggests a peculiar mech- 
anism for recognition of this element by the transposition apparatus. 

We identified 11,809 putative genes encoding 13,197 transcripts. 
Considering genes that do not span a gap, the average gene size is 
4.7kb, typically with large introns (the average is 1,692 base pairs 
(bp)) and much smaller exons (the average is 217 bp). Moreover, 
the introns show a markedly skewed size distribution that has not 
been observed in other eukaryotes, whereby 5’ introns are smaller 
than 3’ introns (Fig. 2, Supplementary Information and Supplemen- 
tary Table 5). In multi-exon genes, the first few introns can be as small 
as 26 bp, whereas introns towards the 3’ end are typically kilobases in 
length (the largest is 33.8 kb). The reason for this is unclear but it 
suggests unusual transcriptional control. However, a survey of 
conserved transcription factor domains shows S. mansoni to be 
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Figure 1| Physical map of S. mansoni. a, b, Idiogram of S. mansoni 
chromosomes W, Z (a) and 3 (b). S. mansoni BAC clones were mapped to the 
karyotype of S. mansoni by FISH. The solid black areas are heterochromatin, 
the open areas are euchromatin. The BAC clones are identified by BAC 
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3/15B2 
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numbers. c-f, Chromosome spreads with FISH-mapped BACS are shown. 

FISH-mapped BACS are identified by arrowheads on labelled chromosomes. 
Scale bar, 10 tm. See Supplementary Fig. 1 for idiograms of all S. mansoni 
chromosomes. 
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Figure 2 | Intron size distribution. The length of introns varies according to 
their position in a transcript, counting from the 5’ end (solid circles) and the 
3’ end (open circles). Mean lengths + standard errors are shown. After about 
five introns, the length difference is no longer apparent owing to the 
variation in the number of introns per transcript (see Supplementary 
Information). 


broadly similar to other eukaryotes (Supplementary Information, 
Supplementary Fig. 2 and Supplementary Table 6). It is note- 
worthy that 43% of transcription factor families with schistosome 
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representatives also contained vertebrate sequences, nearly twice the 
number that matched nematode worms, emphasizing their evolu- 
tionary distance. 


Micro-exon genes 

At least 45 genes have an unusual micro-exon structure. Individual 
micro-exons have been described in other genomes, dispersed among 
several normal exons’. However, S. mansoni is notable in containing 
micro-exon genes (MEGs) that comprise 75% of the coding sequence, 
are flanked at the 5’ and 3’ extremes by conventional exons, and have 
lengths that are multiples of three bases (from 6 to 36). 

Other than having shared gene structure, no similarity could be 
detected between 14 MEG families (each with up to 23 members; 
Fig. 3 and Supplementary Table 7). Moreover, they showed no 
similarity with annotated genes from outside Schistosoma spp., nor 
any identifiable motifs or functional domains. Comparisons between 
MEG family members and related proteins from S. japonicum suggest 
that some gene duplication events preceded the divergence of the two 
species. Almost all encode a signal peptide at the 5’ end and three 
have membrane anchors, so most are probably secreted. Examination 
of the large expressed-sequence tag (EST) data set from across the life 
cycle shows that genes from all MEG families are transcribed in the 
intramammalian stages of the life cycle, and the germ balls of 
daughter sporocysts that develop into infective cercariae, but prob- 
ably not in miracidia that infect the snail intermediate host (Fig. 3). 

Sequencing of transcripts from three MEG families revealed the 
occurrence of several alternative splice variants formed by exon skip- 
ping. In one of the families analysed, all internal exons except those 
coding for the signal peptide were missing in at least one transcript 
sampled, and a gene from a second family presented different 
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Figure 3 | Schematic representation of gene structure from MEG family 
members. a, Structure of a representative member from each MEG family. 
Where several members were found, the total number detected is indicated 
in parentheses. Each box represents an exon drawn to scale, and the number 
above it indicates the exon size in nucleotides. For illustrative purposes, the 
introns are shown with fixed length. Black triangles and diamonds indicate 
exons encoding predicted signal peptides and transmembrane helices, 
respectively. Other characteristics associated with exons are indicated by 
colour and grouped as follow: micro-exons having lengths that either are 
multiples of 3 bp (red) or are indivisible by 3 bp (orange); exons longer than 
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36 bp and having lengths that either are multiples of 3 bp (blue) or are 
indivisible by 3 bp (green); putative initiation and termination exons (grey); 
untranslated region (UTR) (black). The asterisk indicates an exon deduced 
from transcript data, which did not match the sequenced genome. MEG-12 
and MEG-13 structures were only partially predicted owing to the lack of 
transcripts containing the 5’ end of these genes. b, PCR with reverse 
transcription (RT—PCR) or EST-based evidence of transcription (black box) 
for each family across different life-cycle stages. C, cercaria; E, egg; G, 
germball; M, miracidium; 3s and 7s, 3- and 7-day schistosomula; 211i and 
28li, 21- and 28-day liver worms; 45a, 45-day adult worm pairs. 
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transcripts with extended exons produced by the use of alternative 
splicing sites. These observations suggest that a ‘pick and mix’ strat- 
egy is used to create protein variation 


Evolution of triploblasty, parasitism and tissues 


Schistosomes are the first Platyhelminthes to be fully sequenced, and 
provide insights into the evolution of ‘simple’ animals. Using Treefam 
to make comparisons with the sea anemone Nematostella vectensis, a 
representative of the Radiata, we sought gene families restricted to, or 
expanded in, the Bilateria (Supplementary Table 8). The advent of a 
third germ layer in flatworms is paralleled by the expansion of genes 
encoding cell adhesion molecules such as cadherins. Similarly, tissue- 
patterning developmental cues (for example, Notch/Delta) and 
histone-modifying enzymes (for example, histone acetyltransferases) 
have proliferated. Some genes, such as the tetraspanins that encode 
membrane structural proteins, have greatly proliferated in schisto- 
somes, suggesting a critical role in worm physiology/parasitism. The 
large array of paralogues for fucosyl and xylosyltransferases involved 
in the generation of new glycans expressed at the host—parasite inter- 
face may be important for subverting the immune system. The expan- 
sion of proteases in schistosomes also seems to be directly related to 
parasitism, because it includes families involved in host invasion 
(invadolysins) and blood feeding (cathepsins). Furthermore, 
G-protein-coupled receptors (GPCRs) show varying levels of contrac- 
tion in schistosomes, whereas several classes (for example, peropsins) 
are greatly expanded in Nematostella, indicating functions associated 
with the free-living lifestyle. 

Although schistosomes are acoelomate, they possess tissues 
approaching the sophistication of organs—such as gut, nephridia, 
nerve and muscle—that are concerned with discrete physiological 
processes, such as feeding, excretion and locomotion. However, as 
lophotrochozoans they are evolutionarily distant from the previously 
sequenced parasitic nematodes Brugia'® and Meloidogyne'’”? (both 
ecdysozoans). Compartmentalisation of schistosome tissues and 
the formation of epithelial barriers are crucial for life in the hostile 
environment of the host bloodstream. Schistosomes possess the 
typical machinery of higher metazoa to interact with the cytoskeleton 
and control cell polarity (Supplementary Information and Sup- 
plementary Table 9), organize epithelia and denote tissue boundary 
lines. 

S. mansoni possess a nervous system that includes an anterior brain 
and longitudinal nerve cords, which extend from the brain to run the 
length of the worm body. Furthermore, a variety of sensory structures 
(at least six types in the cercaria'’) are able to transduce a wide range 
of stimuli that assist in host location, penetration and navigation 
through the vasculature. In common with more complex organisms, 
schistosomes possess the tools needed to mediate neurogenesis and 
control axon growth cones and migration of neural cells (Sup- 
plementary Information and Supplementary Table 9), supporting 
the ancient origins of neural complexity. 


Insights into possible new drug targets 

Historically, anti-schistosomiasis agents were identified by in vivo 
screening in animal models. The S. mansoni genome project makes a 
more target-based approach to drug discovery feasible, and some 
promising leads have already emerged. These include a family of 
nuclear receptors'* (Supplementary Information) and a redox enzyme, 
thioredoxin glutathione reductase, recently validated as a drug target’”. 
The condensed redox biochemistry of S. mansoni, relative to its human 
host, may offer further drug development targets (Supplementary 
Information). In the context of drug discovery, we have explored other 
potential areas of vulnerability, including: lipid metabolism, GPCRs, 
ligand- and voltage-gated ion channels, kinases, proteases and neuro- 
peptides. We also undertook two bioinformatics-led approaches: 
metabolic reconstruction to identify chokepoints, and sequence 
searches for structures related to known drug targets. 
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Lipid metabolism 

S. mansoni contains a full complement of genes required for most core 
metabolic processes, such as glycolysis, tricarboxylic acid cycle and the 
pentose phosphate pathway. However, schistosomes are incapable of 
de novo synthesis of sterols or free fatty acids and must use complex 
precursors from the host'’. An extensive lipid-carrying protein 
repertoire could be identified, but despite producing precursors for 
fatty acid synthesis, fatty acid synthase could not be identified. An 
inability to use isoprene products of the mevalonate pathway probably 
accounts for the lack of sterol biosynthesis (Supplementary Table 11 
and Supplementary Information). The genes necessary for a complete 
B-oxidation pathway are present, and this usually inactive pathway 
might operate in reverse to perform syntheses'’. Despite constituting 
40% or more of the lipid content of adult worms’*, triacylglycerol has 
an uncertain role in the schistosome’s life cycle—it is slow to turn 
over, does not contribute to the formation of other lipids’® and its use 
as an energy store is doubtful’’. Nevertheless, S. mansoni possesses 
lipases capable of breaking down triacylglycerol, so it may have func- 
tions other than preventing too high concentrations of intracellular 
fatty acids'®. Pathways responsible for synthesizing the phospholipid 
components of membranes are well represented, except that phospha- 
tidylcholine must be derived from diacylglycerol'® and the parasite 
must depend on its host as a source of inositol. 


GPCRs, ligand-gated and voltage-gated ion channels 

GPCRs, ligand-gated and voltage-gated ion channels are targets for 
50% of all current pharmaceuticals’. At least 92 putative GPCR- 
encoding genes are present (Supplementary Table 12), the bulk 
(82) of which are from the rhodopsin family. The largest groups 
are the o-subfamily (30), which includes amine receptors, and the 
B-subfamily (24), which contains neuropeptide and hormone recep- 
tors. The diversity of the former subfamily underlines the wide range 
of potential amine/neurotransmitter reactivities of schistosomes, but 
the tentative identities assigned need to be confirmed by functional 
studies, as has already been performed for a histamine receptor”. 
Schistosomes detect chemosensory cues, but a large, unique clade 
of the mediating receptors was not found. However, the 26 ‘orphan’ 
rhodopsin family GPCRs may include proteins with this role. 
Outside the large rhodopsin family, representatives from each of 
the smaller families of GPCRs, glutamate family (2), frizzled family 
(3), and the secretin/adhesion family (4) are present. 

Each of the three major ligand-gated ion channel families—the 
Cys-loop family, glutamate-activated cation channels, and ATP- 
gated ion channels—are represented in the schistosome genome. 
Of the 13 Cys-loop family ligand-gated ion channels, nine encode 
nicotinic acetylcholine receptor subunits (Supplementary Fig. 4 and 
Supplementary Table 13). The remaining four anion channel sub- 
units group among GABA (y-aminobutyric acid), glycine and glu- 
tamate receptors, but it is not possible to assign precise identities. 
The seven schistosome glutamate-activated cation channels com- 
prise at least two sequences from each of the three common sub- 
groupings. The presence of a functional P2X receptor for ATP- 
mediated signalling in schistosomes was already known”', and the 
data here show at least four more. 

Voltage-gated ion channels generate and control membrane poten- 
tial in excitable cells, and are central to ionic homeostasis. There are 
examples of successful drugs targeting voltage-gated sodium, 
potassium and calcium channels”. Although voltage-gated sodium 
channels were not found, at least 41 members from each of the major 
six transmembrane (6TM) and four transmembrane (4TM) families 
of potassium channels (Supplementary Table 14) are present. The 
61M voltage-gated potassium channel family (20 members) is the 
largest, including the well-characterized Kv1.1 channel found in nerve 
and muscle of adult schsitosomes”’. Other classes of 6TM potassium 
channels include the KQT channels, large calcium-activated channels, 
small calcium-activated channels, and cyclic-nucleotide-gated groups. 
This last group, comprising eight members, is most often associated 
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with signal transduction in primary olfactory and visual sensory cells 
(Caenorhabditis elegans has only five; ref. 24). S. mansoni possesses six 
4TM inward-rectifying TWIK-related potassium channels (about 46 
in C. elegans). There are four « and two B subunits of voltage-gated 
calcium channels in schistosomes, and a B subunit is implicated as a 
molecular target of the anti-schistosomal praziquantel”. 


The kinome 


Protein kinases are important regulators of many different cellular 
functions. Both they and their inhibitors have entered the drug 
development pipeline in recent years”® but few schistosome kinases 
have been characterized to date. The S. mansoni genome encodes 249 
kinases, including 22 genes with alternative splicing (Supplementary 
Information). This corresponds to 1.9% of the total coding proteins 
in the genome, a figure comparable to that found in other species” 
(Supplementary Fig. 6). S. mansoni possesses representatives of all of 
the main kinase groups (Supplementary Fig. 7), the largest of which is 
the CMGC (cyclin-dependent kinases, mitogen-activated protein 
kinases, glycogen synthase kinase 3 and CK2-related kinases) group, 
in contrast to other analysed eukaryotic genomes. However, a single 
class (RCK) is absent from the CMGC family, a deficiency shared 
with yeast but not nematodes or mammals. 

The least represented groups are the casein kinase (CK1) and 
receptor guanylate cyclase families with only seven and three mem- 
bers, respectively, contrasting with C. elegans, in which casein kinase 
is the largest group and receptor guanylate cyclase has 27 members. 
CK1 (and CMGC) group members that are expressed in sperm or 
during spermatogenesis in C. elegans are missing in S. mansoni. 


The degradome 

Proteolytic enzymes (proteases), making up an organism’s “degra- 
dome’, operate in virtually every biological and pathological phe- 
nomenon” and are proven drug targets in diverse biomedical 
contexts*®*'. All five major classes of proteases (aspartic, cysteine, 
metallo-, serine and threonine) are represented as various clans 
(mechanistically related groups) in the parasite genome (Supplemen- 
tary Table 17). The percentage distribution of the major clans is 
generally similar to that of the human host with some notable excep- 
tions, mainly owing to the expansion of constituent protease families in 
humans. Of the 73 protease families, 61 are found in humans and in 
S. mansoni, and 60 families are shared. With 335 sequences, proteases 
comprise 2.5% of the putative proteome (Supplementary Table 18), 
consistent with the proportion in other organisms (1—5%), but this is 
only one-third of that in humans (945 sequences, if A2 family retro- 
virus and retrotransposon proteases are included). 

The greatest difference between host and parasite is in the paucity 
of chymotrypsin-like $1 family enzymes in the latter (22 versus 135 
human sequences). This reflects the evolution and diversification of 
family S1 for complex and highly regulated proteolysis cascades in 
vertebrates and some invertebrates, such as innate immunity, 
development, blood coagulation and complement activation”. 
From a therapeutic standpoint, the reduced complexity may prove 
valuable with fewer parasite proteases available for essential life- 
sustaining functions. For example, robust drug discovery 
programmes are in place for chymotrypsin-like $1 families and 
peptidase C14 (caspases)**, on which anti-schistosomal drug 
discovery could ‘piggy-back’””. It is also notable that a smaller num- 
ber of schistosome protease families (for example, C1, M8 and M13) 
have more members than the respective families in humans. Cl 
proteases are involved in nutrient digestion by the parasite, which 
contrasts with the S1 enzymes used in the host. This disparity has 
already been exploited for a promising anti-schistosome therapy”. 
One protease family (C83) is apparently unique to S. mansoni. 

Apart from the degradome, but involved in its modulation, 34 
protease inhibitors were found (Supplementary Table 19). Most of 
these are serine protease inhibitors belonging to families I2 (Kunitz- 
type) and I4 (serpins). Two inhibitors of cysteine proteases 
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(cystatins*”*°) and two o-2-macroglobulin homologues (139) were 


also identified, as were three inhibitor of apoptosis proteins (132), 
one of which is highly expressed in adults, where it may function to 
regulate one or more of the four schistosome caspases. 


Neuropeptides 


Thirteen putative neuropeptides were identified (Supplementary 
Table 20), indicating that schistosomes may have much greater 
diversity than the two described previously. Apart from the neuro- 
peptide Fs (NPFs), most are apparently restricted to the 
Platyhelminthes—their absence from humans making them a cred- 
ible source of anthelmintic drug leads. The predicted product of 
npp-6 (the amidated heptapeptide AVRLMRLamide) resembles 
molluscan myomodulin, whereas the two NPP-13 peptides show 
100% carboxy-terminal identity with vertebrate neuropeptide-FF- 
like peptides (peptides ending with a C-terminal sequence 
PQRFamide); neither of these has previously been reported in any 
non-vertebrate organism. The discovery of a second NPF (NPP-21b) 
as well as the known NPP-21a*' is reminiscent of the vertebrate 
neuropeptide Y (NPY) superfamily, and strengthens the argument 
that NPFs and NPYs have a common ancestry. 


Metabolic chokepoints 

A chokepoint analysis of metabolic pathways reconstructed from the S. 
mansoni genome was used to identify further targets. A total of 607 
enzymatic reactions could be placed in pathways, and 120 of these 
enzymes were identified as chokepoints (Supplementary Table 21). 
The list of chokepoints includes many that are drug targets in other 
organisms as wellas target reactions already characterized in S. mansoni, 
validating the approach (Supplementary Information). The list also 
contains new candidate targets and comprises approximately 1% of 
the S. mansoni proteome. 


Chemogenomics screening 


In the context of neglected tropical diseases and with constrained 
investment in drug discovery, piggy-backing” or ‘drug-repositioning’ 
strategies’ that re-use existing drugs offer potential time-saving and 
cost benefits. We adopted a twofold strategy to find significant matches 
between proteins from the parasite and known ‘druggable’ protein 
targets of the human host and human-infective pathogens. Using con- 
servative parameters of >50% sequence identity over >80% of the 
target, we first performed a similarity search against a database of 
targets curated from medicinal chemistry literature. This revealed 
240 distinct S. mansoni transcripts with matches to targets against 
which there are high quality compounds (Supplementary Table 22). 
Given the need for short-course, oral therapies against schistosomiasis, 
this list was further reduced to 94 S. mansoni targets by filtering for 
potency and predicted bioavailability. A second search, against a data- 
base of the targets for human-directed drugs, showed 66 significant 
matches with pharmaceuticals marketed at present (Supplementary 
Table 23), corresponding to 34 S. mansoni targets (26, after represent- 
ing multicopy genes as a single instance; Table 1). For instance, 
disulfiram, for controlling substance abuse, was highlighted as a poten- 
tial anti-schistosomal drug; its anti-parasite properties have already 
been investigated**. Manual inspection of the list for compounds with 
side effects and toxicity can further refine choices—for example, by 
eliminating the immunosuppressants, cyclosporin and rapamycin. 
The remaining known drugs could be directly tested in animal models, 
and either applied unmodified in anti-schistosomal therapy, or could 
serve as leads for further optimisation. Widening the search beyond 
the initial strict criteria would expand opportunities, for example, 
topoisomerase | is retrieved below our initial threshold, at 71% iden- 
tity but only 58% overlap. 


Conclusion 


Acentury after Louis Sambon first named the species in 1907 (ref. 44), 
the sequencing of the S. mansoni genome is a landmark event. The 
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Table 1| S. mansoni genes that match a human gene with marketed drugs 
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Gene identifier Protein description 


Potential drugs 


Smp_005210 Histone deacetylase 1 (HDAC1) 

Smp_009030 Ribonucleoside-diphosphate reductase, « subunit, putative 
Smp_012930 Inosine-5-monophosphate dehydrogenase, putative 
Smp_015020 Na*,K*-ATPase « subunit (SNaK1) 

Smp_016780* — Tubulin « chain, putative 

Smp_022960 Aldehyde dehydrogenase, putative 

Smp_026560 Calmodulin, putative 


Smp_030730* — Tubulin B chain, putative 


Vorinostatt 

Fludarabine phosphatet 

Mycophenolate mofetilt, mycophenolic acidf, ribavirin’ 

Digoxint, digitoxint, acetyldigitoxin8, deslanoside8 

Vinblastinet, colchicine, vincristinet 

Disulfiramt 

Bepridil8 

Colchicinet, vinblastinet, vincristinet, albendazolet, mebendazolet, 
paclitaxelt, thiabendazolet, vinorelbinet, docetaxel8 

Cyclosporinet 

Cyclosporinet 

Fomepizolet 

Auranofint 

Disulfiramt 

Tolrestatt 

Hydroxyureat, gemcitabinet 

Ciclopiroxt 

Cyclosporinet 

Pimecrolimust, temsirolimusf, sirolimust, tacrolimust 

Vorinostatt 

Cyclosporinet 

Rauwolfia serpentinat, reserpinet, deserpidine8, rescinnamine8, alseroxylon8 
Pemetrexedt, flucytosinet, floxuridinet, capecitabinet, gluorouracilt 
Dasatinibt 

Tiludronate8 

Carbidopat 

Cyclosporinet 


Gene identifier is the genome project systematic name for each gene. It corresponds to the locus tag in the DDBJ/EMBL/GenBank record and to the main accession numbers for GeneDB. 


Smp_040130 Cyclophilin (p17.7) 

Smp_040790 Cyclophilin B 

Smp_044440 Alcohol dehydrogenase, putative 

Smp_048430 Thioredoxin glutathione reductase 

Smp_050390 Aldehyde dehydrogenase, putative 

Smp_053220 Aldo-keto reductase, putative 

Smp_055890 Ribonucleoside-diphosphate reductase small chain, putative 
Smp_065120 Deoxyhypusine synthase, putative 

Smp_069160 Cyclophilin, putative 

Smp_079230 Immunophilin FK506 binding protein FKBP12, putative 
Smp_093280 Histone deacetylase 3 (HDAC3) 

Smp_094810 Cyclophilin E 

Smp_121920 Vesicular amine transporter, putative 

Smp_135460 Bifunctional dihydrofolate reductase-thymidylate synthase, putative 
Smp_136300 Tyrosine kinase 5 

Smp_147050 ATP synthase « subunit vacuolar, putative 

Smp_171580 Aromatic amino acid decarboxylase, putative 

Smp_173280 Cyclophilin, putative 

* There are several copies of 


ubulin (alpha, Smp_027920, Smp_090120 and Smp_103140; beta, Smp_192110, Smp_079960, Smp_079970, Smp_078040 and Smp_035760). 


The potential drugs are classified according to the confidence with which the efficacy of the drug in human can be attributed to the target. 


+ Direc 
£ Direc 


and clear evidence that this interaction is primarily responsible for the therapeutic action of the drug. 
and clear evidence that this interaction represents one mechanism for the drug, other targets/mechanisms may also exist. 


8 Indirect or inferred evidence of the association of the drug, target and therapeutic action, although the exact mechanism is still speculative. 


sequence provides the scientific community with several avenues to 
study this under-researched human pathogen, and will drive future 
evolutionary, genetic and functional genomic research. Not least, 
given that just one drug is widely available to treat schistosomiasis 
at present, the genome sequence, including the genome-mining ana- 
lysis presented, offers the possibility that new drug candidates will 
soon be identified. 


METHODS SUMMARY 


Mixed-sex cercariae from the Puerto Rico isolate of S. mansoni’, released from 
infected Biomphalaria glabrata snails, were placed in low-melting agarose plugs 
and genomic DNA was prepared by standard methods. Approximately sixfold 
coverage of the nuclear genome was obtained using a whole-genome shotgun 
sequencing approach, in which libraries of different cloned insert sizes (in plas- 
mid, fosmid and bacterial artificial chromosomes (BAC) vectors) were randomly 
sequenced by Sanger technology from either end. Sequence reads were 
assembled, and scaffolds were FISH-mapped to individual chromosomes where 
possible (Supplementary Table 2). The output of several gene prediction algo- 
rithms, trained using 409 manually curated gene structures, were integrated into 
a single set of gene predictions (version 4), which were used for subsequent 
analyses. Data were accessed from GeneDB (http://www.genedb.org), and 
Artemis was used for manual annotation and curation of a further 958 genes 
during subsequent analyses (as described previously**). 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 

Genome sequencing, assembly and mapping. The most commonly used Puerto 
Rican strain of S. mansoni* was maintained in albino B. glabrata snail and NMRI 
mice and golden hamsters as laboratory hosts (Mesocricetus auratus). Cercariae 
released from infected snails were resuspended in PBS at a concentration of 
5 X 10° cercariae ml !. The parasites were transferred to a 42 °C water bath, 
incubated for 5 min, and mixed with an equal volume of 1.2% low-melting point 
agarose (Gibco-BRL) in PBS at 42 °C. The agarose/cell mixture was transferred to 
a disposable plug mould (Bio-Rad), placed on ice, treated twice for 24h at 50 °C 
with 1% N-lauroyl sarcosine, 0.5M EDTA, pH8.0, 2mgml | proteinase K 
(Boehringer Mannheim). Proteinase K was then inhibited by a 30-min treatment 
with PMSF (401g ml~'), followed by three successive 30-min dialyses against 
10 mM Tris-HCl, pH 8.0, 0.1 mM EDTA. Sequencing libraries were constructed 
using genomic DNA extracted from mixed-sex cercariae. Sequencing reads were 
produced from small insert plasmid clones containing a range of insert sizes. In 
addition, 12,305 BAC end sequences from S. mansoni BAC library Sm1 (DDBJ/ 
EMBL/GenBank accession numbers BH199420—BH211620), 19,136 CHORI 103 
BAC end sequences (DDBJ/EMBL/GenBank accession numbers DX983724— 
ED003998) and 16,628 fosmid end reads were included. After filtering out low 
quality reads, 85% of the remaining 3.19 million reads were assembled using 
Phusion” into 381 Mb, or 363 Mb after filtering small (<2 kb) contigs (see 
Supplementary Table 1). The size is greater than the previously estimated size 
of 270 Mb*’, although this size estimate can be revised to 300 Mb because the 
original measurements were made using the E. coli genome as a control, which 
has a length that is 10% greater than previously thought. From the assembly, a 
depth of coverage of ~sixfold was calculated. 

A physical map was generated using FISH to localize S. mansoni BACs to the 
seven autosomal and sex pairs of chromosomes using previously published 
methods”. Clones from two BAC libraries, Sm1 (ref. 50) and CHORI-103 
(http://bacpac.chori.org/schis103.htm), each constructed from cercarial DNA 
were randomly picked and subjected to FISH analysis. Owing to the repetitive 
nature of the schistosome genome, BACs would often hybridize to more than 
one chromosome. This was in spite of using sheared genomic DNA to block the 
repetitive sequences. Of the 500 clones analysed, 334 showed unique hybridiza- 
tion patterns (Fig. 1, Supplementary Fig. 1 and Supplementary Table 2). A total 
of 118 BACs that were FISH-mapped were among those end-sequenced, and 153 
scaffolds were assigned to a specific chromosome. 

Retroelements analysis. We performed an iterative search of retroelements 
using the conserved reverse transcriptase domain as previously described”. 
Elements with higher than 80% nucleotide identity in the reverse-transcriptase 
region were considered as members of the same family. To obtain an unbiased 
estimate of abundance for each element in the genome, all the identified families 
were mapped to the shotgun reads using BLASTN*. The number of bases 
spanned by the alignment for each element was counted and compared with 
the total number of bases in the shotgun data to determine their representation 
in the S. mansoni genome. 

Genome annotation and repeat content analysis. A training set (for ab initio 
gene finding) of 409 genes was manually curated from S. mansoni sequences 
already within the Uniprot database and manual prediction of highly conserved 
genes. Further genome-wide gene predictions were made using both 
EVidenceModeller and PASA”’. EVidenceModeller uses an evidence-combining 
strategy to compute an optimal set of protein-coding gene structures derived 
from several, often conflicting, sources of gene predictions. The sources of evid- 
ence for our annotation of the S. mansoni genome included the following: ab 
initio gene predictions derived from GlimmerHMM™, TWINSCAN”, and 
Augustus”; protein sequence homologies to a non-redundant protein database 
using AAT’; cross-genome sequence homologies between S$. mansoni and 
S. japonicum using PROmer”; spliced genome alignments to ESTs using 
GMAP”; and repeat regions identified using RepeatScout® and RepeatMasker 
(A. F. A. Smit, R. Hubley and P. Green, unpublished observations, http:// 
www.repeatmasker.org). Consensus gene predictions generated by 
EVidenceModeller were further modified to include annotations of untranslated 
regions and alternative splicing isoforms for 1,038 genes by applying PASA 
leveraging the earlier GMAP aligned ESTs. A total of 13,197 transcripts were 
predicted for 11,809 genes. Of the 30,110 previously described EST clusters, 
24,373 map to contigs >1 kb in the current genome assembly. The true number 
of genes could therefore be as high as 17,500. By parsing BLAST description lines, 
putative products were assigned to each gene. During the course of subsequent 
analyses, 958 of these were manually edited using the Artemis annotation tool. 

For an unusually large gene, encoding a putative ryanodine receptor spanning 
~164kb, 79 of its 93 intron—exon boundaries were confirmed by RT-PCR. 
Approximately 45% of the S. mansoni genome was found to be repetitive, com- 
puted by summing up genomic bases matching known S. mansoni mobile 
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element sequences or repeat family consensus sequences derived from the 
RepeatScout de novo repeat library. The repeat content was also assessed on 
the basis of the distribution of random sequences 25 nucleotides in length, 
104,028,213 out of 373,600,457 or 28% of bases were repetitive. Note, this value 
is significantly lower than that of RepeatMasker because the latter allows 
sequence divergence of up to 20%. 

Analysis of putative transcription factors. Profile hidden Markov models 
(HMMs) of domains present in the proteins that constitute the TRANSFAC 
eukaryotic transcriptional factor database®! and the DBD DNA-binding domain 
database” were used to search the genome of S. mansoni in conjunction with 63 
other eukaryotic genomes. The score threshold was defined as the lowest pairwise 
score among all members of the Pfam family associated to the HMM. The 
putative transcriptional activator proteins were then clustered on the basis of 
sequence similarities (BLASTP Evalue = 10° considered significant) using the 
TRIBE-MCL algorithm® and an inflation value of 2.0 (ref. 64). 

Micro-exon genes. MEGs were predicted as previously described’ with further 
manual refinement using available S. mansoni EST data (including both published 
data® and unpublished data from GenBank/dbEST or ftp://ftp.sanger.ac.uk/pub/ 
pathogens/Schistosoma/mansoni/ESTs/). Further family members were iden- 
tified by similarity searches against the available supercontigs in the assembly 
with long flanking MEG exons as query sequences. Signal peptides and transmem- 
brane domains were detected using SignalP® and TMHMM 2.0 (ref. 67) pro- 
grams, respectively. 

Expression of MEG families at different stages throughout the life cycle was 

analysed by BLAST searching the sequences of all members ofa family against the 
complete S. mansoni EST data set, which comprises ESTs from the following 
developmental stages: germball (28,497), cercaria (21,639), 3-day somule 
(6,122), 7-day somule (41,043), 21-day liver worm (6,044), 28-day liver worm 
(11,227), 45-day adult worm (59,552), egg (33,674) and miracidium (19,982). 
Evolutionary analysis. To identify orthologues and paralogues of S. mansoni 
genes, we built a standalone version of the TreeFam database (version 7) of 
animal gene families’. For each S. mansoni predicted protein, we identified 
the top-matching TreeFam ‘clean’ family using HMMER” (with EX 10 '° asa 
cutoff). Similarly, the top-matching family was identified for each Nematostella 
vectensis (release version 1.0)’! and S. japonicum protein. Trees and alignments 
were built for the families as for the standard TreeFam pipeline. This resulted in 
trees for 5,829 families that contain S. mansoni, S. japonicum or N. vectensis 
genes. From these trees, we identified within-species paralogues in the three 
species, and identified the ancestral taxon in which the duplication that gave 
rise to each pair of paralogues occurred. 
Kinome. A eukaryotic protein kinase domain HMM was built from a manually 
adjusted alignment of 68 diverse kinase domain sequences from yeast, worm, fly 
and human that share <50% sequence identity in the catalytic domain. To test the 
selectivity of the model, it was run against the Uniprot database. Using a P< 0.1 
cutoff, the model detected 2,688 putative domains, all of which were annotated 
either as kinases or putative kinases in different description fields. Local and global 
HMM models were built with the HMMer package (http://hmmer.janelia.org/) 
from several sequence alignments generated by MAFFT software” and were used 
for sensitive searches against the S. mansoni database. 

Identified genes were annotated using Artemis, integrating data from 
Interproscan and Reverse PSI-BLAST searches” and the size of the S. mansoni 
kinome was compared with those of: Plasmodium falciparum’*, Homo sapiens”, 
Trypanosoma cruzi’, Trypanosoma brucei’®, C. elegans’, Leishmania major’* and 
Mus musculus”. A dendogram was constructed using the kinase domains of the 
identified proteins with the CLC Main Workbench (CLC bio) using the neigh- 
bour-joining method with 1,000 replicates. 

Identification of putative proteases and inhibitors. We used the MEROPS 
database*’ (http://merops.sanger.ac.uk) to identify active S. mansoni proteases 
and protease inhibitor homologues, using BLASTP**” with E< 10 ° asa cutoff. 
More distant relatives were identified through HMMER version 2.3.2 (ref. 70) 
searches of Pfam models"! that corresond to MEROPS families (Pfam version 22.0 
(ref. 82), http://pfam.sanger.ac.uk/), using the same E-value cutoff. This initial 
data set contained 656 provisional homologues, having removed predicted 
proteases <80 residues in length as well as provisional inhibitors <50 residues 
long. A secondary screen against the NCBI non-redundant protein database 
retained a total of 369 S. mansoni sequences, which overlapped in at least 50% 
of their MEROPS hit or Pfam domain with an experimentally characterized 
protease or inhibitor homologue. False positives were removed by comparing 
nonspecific MEROPS description lines (for example, ‘non-peptidase homolo- 
gues’) to the top non-redundant BLAST hits with an E-value at least 3 logs greater 
than the top MEROPS or Pfam hit but lacking associated experimental validation. 
This approach removed MEROPS proteins that are not functional proteases but 
are structurally related (such as hormone-sensitive lipases in the family S9; flagged 
as homologues of proteins that are inactive protease homologues in 
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Supplementary Table 18). Similarly, the Pfam database models domains found 
not only in proteases and inhibitors but also in a wide range of other proteins (for 
example, PF00047, PF00059, PF00561, PF01476 and PF0764) were also removed 
as false positives in the absence of further evidence. 

We next predicted which of the putative protease homologues were likely to be 
active or inactive. BLAST alignments of proteins against putative homologues 
classified in MEROPS predicted active site positions and residues in the 
S. mansoni query sequence, followed by manual inspection of sequence align- 
ments to refine active site residue predictions. In a few cases, in which an accept- 
able alignment was not produced by BLASTP of MEROPS, a non-redundant 
sequence was used. In more difficult cases, involving two closely related 
S. mansoni sequences, active site residues were identified from multiple align- 
ments of S. mansoni sequences, a representative sequence for the corresponding 
MEROPS family, and the seed alignment sequences for the relevant Pfam model. 
Metabolic chokepoint analysis. An S. mansoni metabolic pathways database, 
SchistoCyc (http://schistocyc.schistodb.net/ptools), was created using the 
Pathway Tools software®’, which contains algorithms to predict an organism’s 
metabolic pathways from its genome by comparison to MetaCyc, a reference 
pathways database*’. From the pathway database, potential chokepoint reac- 
tions® were identified (those that uniquely consume a specific substrate or 
produce a specific product). Chokepoint reactions are probably critical to nor- 
mal cellular physiology and therefore represent potential drug targets. 
Chemogenomics. To identify, in S. mansoni, putative proteins for which thera- 
peutic compound or high quality chemical tools may already be available, 
sequence similarity searching was performed using BLASTP against the Target 
Dictionary from Drugstore!’ (database of Food and Drug Administration 
approved drugs) and StARlite (a database of Structure Activity Relationship data 
abstracted and indexed manually from the primary literature and at present 
containing 440,055 unique compounds, directed against approximately 3,500 
distinct molecular targets from the primary literature). The results were strin- 
gently filtered for significance: =50% identity, =80% overlap of the target and a 
BLAST E= 10°. To prioritise 755 hits to StARlite, we applied filters for 
potency/affinity against the matched target, combined with an estimate of the 
likelihood that the compound could be orally absorbed. The potency cutoff was 
set at a half-maximal inhibitory concentration (ICs), inhibition constant (Kj), 
or dissociation constant (Ky) of 100nM or better, and oral bioavailability was 
estimated using the ‘rule of five’ (molecular weight of no more than 500 Da, 
clogP less than five, no more than five hydrogen bond donors and no more than 
ten nitrogen or oxygen atoms)**. The drugs associated with matches in the 
DrugStore database were classified according to a broad range of current thera- 
peutic categories: (1) direct and clear evidence that this interaction is primarily 
responsible for the therapeutic action of the drug; (2) direct and clear evidence 
that this interaction is one mechanism for the drug but other targets or mechan- 
isms may exist; and (3) indirect or inferred evidence of the association of the 
drug, target and therapeutic action. 
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The active form of DNA polymerase V is 
UmuD’,C-RecA-ATP 


Qingfei Jiang', Kiyonobu Karata”, Roger Woodgate’, Michael M. Cox® & Myron F. Goodman’ 


DNA-damage-induced SOS mutations arise when Escherichia coli DNA polymerase (pol) V, activated by a RecA 
nucleoprotein filament (RecA*), catalyses translesion DNA synthesis. Here we address two longstanding enigmatic aspects 
of SOS mutagenesis, the molecular composition of mutagenically active pol V and the role of RecA*. We show that RecA* 
transfers a single RecA-ATP stoichiometrically from its DNA 3’-end to free pol V (UmuD’,C) to form an active mutasome 
(pol V Mut) with the composition UmuD’2C-RecA-ATP. Pol V Mut catalyses TLS in the absence of RecA* and deactivates 
rapidly upon dissociation from DNA. Deactivation occurs more slowly in the absence of DNA synthesis, while retaining 
RecA-ATP in the complex. Reactivation of pol V Mut is triggered by replacement of RecA-ATP from RecA*. Thus, the 
principal role of RecA* in SOS mutagenesis is to transfer RecA-ATP to pol V, and thus generate active mutasomal complex 


for translesion synthesis. 


Pol V is a low-fidelity DNA polymerase'’ induced as part of the SOS 
regulon in E. coli in response to DNA damage’. The replicative poly- 
merase, pol III, typically stalls when it encounters a DNA template 
lesion, arresting movement of the replication fork. One pathway that 
enables restoration of fork movement involves pol V, which replaces 
pol III on the sliding B-clamp*® and catalyses translesion DNA syn- 
thesis (TLS). Pol V copies numerous types of lesions’, but in a muta- 
genic manner*”. After TLS, pol III resumes normal replication. 

The pol V complex consists of UmuD’,C'®"’. Both in vitro’*'*° 
and in vivo'*'®, pol V activity requires the assembly of an active RecA 
filament on single-stranded (ss) DNA, termed RecA*!’. The bio- 
logical functions of RecA* in strand exchange during homologous 
recombination and in mediating cleavage of the repressor protein 
LexA and UmuD during the SOS response are well understood’’. In 
contrast, the biochemical role of RecA* in pol-V-dependent muta- 
genic TLS remains poorly characterized. 

Proposals for the role of RecA* in TLS have evolved from position- 
ing UmuD’,C on primer/template (p/t) DNA proximal to a 
lesion'**°, to a dynamic interaction involving displacement of 
RecA* filaments on the template by an advancing pol V*, to a model 
in which RecA* need not be located in cis on the template strand 
being copied, but can instead assemble on a separate ssDNA strand to 
transactivate pol V for TLS*!. What had not been contemplated in 
previous models, and what previous experimental designs were 
unable to detect, is whether a RecA* filament might function remo- 
tely from the site of translesion synthesis. In this unexplored scenario, 
RecA* would transfer a RecA monomer and ATP from the 3’-tip of 
RecA* to free UmuD',C, thereby converting pol V (UmuD’,C) to an 
activated form that can be used elsewhere for TLS. 


Formation of activated pol V Mut 

In earlier studies, either free RecA protein or RecA filament was 
added to pol V and p/t DNA substrate. Here, we first form RecA* 
by incubating RecA with biotinylated ssDNA bound to a streptavidin- 
agarose resin matrix in the presence of ATPyS (adenosine 5’[y-thio] 
triphosphate; see Methods). Pol V (UmuD',C) is then incubated with 


RecA* in the absence of p/t DNA, forming an activated pol V species 
that can be isolated, pol V Mut (Supplementary Figs 1 and 2). Only 
once RecA* is removed by centrifugation is p/t DNA added, allowing 
DNA synthesis in the absence of RecA* (Fig. 1a, b). In these studies, 
the p/t template is a hairpin with a 3-nucleotide overhang” instead of, 
for example, an oligonucleotide annealed to a ssDNA circle; this 
prevents formation of activated RecA* on the p/t DNA itself because 
the hairpin lacks free ssDNA on which RecA* can assemble (the 
3-nucleotide overhang is too short to permit this). Throughout, we 
use the properties of two key RecA mutant proteins, highly active 
RecA E38KAC17 (also known as RecA730AC17; ref. 22), and muta- 
genically inactive RecA F117S (also known as RecA1730; refs 15, 23). 

Removal of the RecA*-resin by centrifugation allows activated 
pol V Mut to remain in solution (Supplementary Fig. 2). Filaments 
containing RecA E38KAC17 activate pol V with approximately two- 
fold higher efficiency (Fig. 1a) than wild-type RecA* (Fig. 1b). Pol V 
avidly copies p/t DNA when RecA* is present in trans, as reported 
previously*' (Fig. 1c, d). Thus, exposure of UmuD',C to RecA* is 
required for pol V activity. Most importantly, pol V remains highly 
active after removal of RecA* from the reaction (Fig. 1a, b). 

To investigate how long pol V Mut remains active in the absence of 
RecA*, we incubated the enzyme in standard reaction buffer at 37 °C 
for various times before adding the p/t DNA and dNTP substrates to 
initiate DNA synthesis. Polymerase activity decreases as the delay 
time increases (Fig. le, f). The decrease in activity is approximately 
exponential, and is similar for pol VMut activated with either 
RecA E38KAC17* or wild-type RecA* (Fig. 1g). The loss in activity 
is reversible; addition of RecA* in trans at each time point restores 
pol V Mut to a fully activated state (Fig. 1g). ATP hydrolysis has no 
role in pol V activation or in the activity decay, because synthesis by 
pol V Mut and the loss of activity during the time delay is identical 
when RecA* is assembled with ATP, ATPyS or AMP-PNP (f-y- 
imidoadenosine 5’-triphosphate; Supplementary Figs 3 and 4). 

Each active pol VMut complex can promote only one round of 
DNA synthesis (Fig. 1h, i). A plot of primer use against time shows 
that, after the addition of one or more nucleotides, pol V Mut cannot 
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Figure 1| DNA synthesis by pol V Mut or pol V transactivated by RecA*. 
The p/t DNA is a hairpin containing a 3-nucleotide template overhang. 

a, b, DNA synthesis by pol V Mut—(RecA E38KACI17) (a) or 

pol V Mut—(wild-type RecA) (b) in the absence of RecA*.¢, d, DNA synthesis 
by pol V transactivated by RecA E38KAC17* (c) or wild-type RecA* 

(d). e, f, DNA synthesis by pol V Mut—(RecA E38KAC17) (e) or 

pol V Mut—(wild-type RecA) (f) undergoing deactivation as a function of 
time. g, Deactivation of pol V Mut—(RecA E38KACI17) and pol V Mut—(wild- 
type RecA) measured by quantifying DNA synthesis obtained from data in 
panels e and f, respectively. Complete reactivation of deactivated pol V Mut 


reinitiate synthesis on a different p/t DNA (Fig. 1h, i). Deactivated 
pol V Mut is reactivated by adding RecA* in trans (Fig. 1i), but not 
by adding either RecA (Fig. 1i) or ssDNA alone (data not shown). After 
synthesis, deactivated polV Mut dissociates into solution (Sup- 
plementary Fig. 5). Pol V Mut catalyses TLS in the absence of RecA*, 
but without pre-activation, pol V is inactive even on undamaged DNA 
(Supplementary Fig. 6). 


Pol V Mut consists of UmuD’,C-RecA-ATP 

Pol V Mut activated with RecA E38KAC17* synthesizes DNA equally 
well with or without a nucleotide cofactor, ATP or ATPyS, added 
post-activation (Fig. 2a). In marked contrast, pol V Mut activated 
with wild-type RecA* is inactive in the absence of added ATP or 
ATPYS (Fig. 2b). These properties indicate that activation entails 
formation of a pol V-RecA complex. A denaturing gel of activated 
pol V Mut provides visual evidence that UmuD’,C forms a complex 
with RecA (Fig. 2c). A plot of RecA released from RecA* as a function 
of increasing polV concentration shows a I:1:1 ratio of 
RecA:UmuC:UmuD’, over a tenfold range of pol V Mut (Fig. 2d). 
When resin-bound RecA* is pre-incubated in the absence of pol V 
and centrifuged, an insignificant amount of RecA is released from the 
resin into the supernatant (Fig. 2c, lane 1) compared with the amount 
of RecA bound to pol V Mut (Fig. 2c, lanes 2-5). Quantification of 
ATPYS released into the supernatant also indicates that a RecA-ATP 
complex is transferred from RecA* to pol V to form pol V Mut, in a 
1:1:1 stoichiometry of UmuD’,C-—RecA-ATP (Fig. 2c, d). 

To obtain direct evidence that RecA is part of pol V Mut, we 
formed pol V Mut using fluorescein-labelled (FI-) wild-type RecA 
(Fig. 2e); its behaviour is indistinguishable from pol V Mut activated 
with unlabelled wild-type RecA (Supplementary Fig. 7). We iden- 
tified the components of the activated and deactivated pol V Mut 
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is observed by addition of trans RecA* at each delay time. h, pol V Mut 
performs one round of DNA synthesis and cannot reinitiate synthesis on 
separate p/t DNA substrate. The concentration of p/t DNA is 1 uM. We 
estimate that the fraction of active polymerase in the reaction is 50% for 
pol V Mut—(RecA E38KACI17) and 25% for pol V Mut—(wild-type RecA). 

i, Addition of trans RecA E38KAC17* enables pol V Mut—(RecA E38KAC17) 
to reinitiate DNA synthesis on a separate p/t DNA substrate. Cycling also 
occurs using pol V Mut—(wild-type RecA) in conjunction with trans wild- 
type RecA* (data not shown). Trans RecA* is at 1 1M when present. 


species by affinity chromatography of pol V Mut—(Fl-wild-type 
RecA) either immediately after activation or after delay times of 
1 or 2h. After elution from the column, Fl-wild-type RecA and 
[y-*°S]ATP are retained in a complex with UmuC and UmuD’ irre- 
spective of delay time (Fig. 2e). Thus, deactivation of pol V Mut is 
neither caused nor accompanied by a concomitant loss of RecA or 
ATPYS. We verified the composition of reactivated pol V Mut by 
incubating pol V-(Fl-wild-type RecA) complexes isolated after a 
2h delay time with increasing amounts of unlabelled wild-type 
RecA* (Fig. 2f). After adsorption and elution of pol V-RecA from 
the column, the Fl-wild-type RecA and [y-*°S]ATP are readily 
replaced with unlabelled wild-type RecA and ATPYS, with greater 
replacement occurring as the concentration of unlabelled wild-type 
RecA* is increased (Fig. 2f). These data confirm that UmuD’,C, 
RecA and ATP are present in 1:1:1 stoichiometry (Fig. 2d). 

We used laser multi-angle light scattering (MALS)™ to evaluate 
the molecular mass of pol V Mut—(wild-type RecA). Pol V alone was 
activated by incubating UmuD',C with trans wild-type RecA* bound 
to resin. After centrifugation, the supernatant containing a mixture 
of pol V Mut—(wild-type RecA) and non-activated pol V was resolved 
by size-exclusion chromatography. The column effluent was passed 
directly on-line into a MALS detection system. Two light scattering 
peaks were observed (Fig. 3a, upper trace). The peak eluting at 
17.3 min indicates a molecular mass of 113 kDa, in accord with the 
value predicted for UmuD’,C-(wild-type RecA) (110kDa). The 
peak eluting at 18.4 min indicates a molecular mass of 73 kDa, in 
accord with UmuD’,C (72 kDa), and coinciding with the single peak 
exhibited by non-activated pol V (UmuD’,C; Fig. 3a, lower trace). 
Analysis of the two peaks shows that UmuC, UmuD’ and wild-type 
RecA are present in the 113 kDa peak, whereas only UmuC and 
UmuD’ are in the 73kDa peak (Fig. 3b). Thus, the MALS data 
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Figure 2 | Activated pol V Mut complex is composed of 
UmuD’,C-RecA-ATP. a, b, DNA synthesis by pol V Mut—(wild-type RecA) 
(b), but not pol V Mut—(RecA E38KAC17) (a), requires the presence of 
ATPYS or ATP. ¢, d, Activated pol V Mut—(wild-type RecA) complex 
contains UmuC, UmuD’ and wild-type RecA, in a 1:1:1 ratio. e, Fl-wild-type 
RecA and ATPyS remain bound in deactivated pol V Mut—(Fl-wild-type 
RecA) complexes as measured by affinity chromatography. f, Reactivation of 
pol V Mut—(Fl-wild-type RecA) is achieved by replacing Fl-wild-type RecA 
and [*°S]ATPyS bound to deactivated pol V Mut-(Fl-wild-type RecA) with 
unlabelled wild-type RecA and ATPyS from wild-type RecA*. Reactivated 
pol V Mut—(Fl-wild-type RecA) complexes were isolated as in panel e. 


provide independent verification that the intact pol VMut—(wild- 
type RecA) complex contains UmuD’,C bound to wild-type RecA, 
in accord with the affinity column binding and elution analysis of 
pol V Mut—(wild-type RecA) (Fig. 2e, f). 


Activation requires transfer from the 3’-proximal tip of RecA* 
We have previously shown that pol V is activated only when it 
encounters a RecA molecule located at the 3’-proximal tip of 
RecA**', We determined whether RecA is transferred to pol V from 
one or both RecA* filament ends by incubating pol V with RecA* 
having either the 3’- or 5’-proximal tip exposed (Supplementary 
Fig. 8a). After removing RecA* by centrifugation, the supernatant 
containing pol V Mut was separated by SDS—polyacrylamide gel elec- 
trophoresis (Fig. 4a). The data show that one molecule each of RecA 
and ATPYS is transferred to pol V from both ends of RecA* (Fig. 4a 
and Supplementary Fig. 8b). However, pol V becomes activated only 
if wild-type RecA or RecA E38KACI17 is transferred from the 
3'-proximal tip of RecA* (Fig. 4b). RecA F117S is also transferred to 
polV from either the 3’- or 5’-tip, but fails to activate pol V in both 
cases (Fig. 4a, b). 


Discussion 


We propose a comprehensive biochemical model depicting pol V 
activation, deactivation and reactivation (Fig. 5). Pol V (UmuD’,C) 
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Figure 3 | Determination of the molecular mass of pol V Mut-(wild-type 
RecA) by MALS. a, After wild-type-RecA*-mediated transactivation of 
UmuD’,C and removal of wild-type RecA*, the mixture of pol V Mut—(wild- 
type RecA) and non-activated pol V was resolved by size-exclusion 
chromatography (upper trace), and the molecular mass corresponding to 
each peak was measured by MALS. Non-activated pol V run separately on 
the silica gel elutes at 18.4 min (lower trace). b, Silver-stained 
SDS-polyacrylamide gel showing the protein composition from the two 
peaks contained in panel a (upper trace). 


first migrates to a preformed RecA* filament. It is activated by the 
transfer of a 3’-RecA-ATP subunit from RecA* to polV to form 
activated pol V Mut, that is: UmuD’,C-—RecA-ATP. Pol V Mut scans 
the genome for sites where replication has stalled. When a suitable 
primer/template is located, pol V Mut binds and promotes DNA syn- 
thesis in the absence of RecA*. Pol V Mut is inactivated by one round 
of primer extension. Subsequent RecA*-mediated reactivation of 
pol V Mut occurs when the complex again encounters a RecA* fila- 
ment, and replaces the ‘used’ RecA—ATP, which is still bound, but in 
an inactive conformation, with ‘new RecA-ATP (Fig. 5). The model 
has the unique feature that there is no need for an active RecA* 
filament to be present during TLS; also, although ATP is required, 
ATP hydrolysis is not. 

Pol-V-mediated TLS requires the presence of RecA* both in 
vivo'*'® and in vitro’”'*'’. Since the late 1980s the most important 
but refractory questions have been ‘what is the molecular composi- 
tion of the pol V mutasome?’ and ‘what is RecA* doing?’. The 
requirement for RecA* initially suggested that a RecA filament would 
form in cis*'?, However, RecA filaments formed in cis strongly 
impede pol-V-mediated DNA synthesis‘. Activation in trans resolves 
the difficult situation of pol V replication of DNA with RecA bound 
to it*’. However, transactivation introduces new structural and topo- 
logical problems associated with a RecA filament bound to DNA in 
one genomic location interacting continuously with a pol V replic- 
ating ina DNA gap somewhere else. And finally, how is pol V inacti- 
vated when the SOS response is no longer needed? 

The model in Fig. 5 answers these questions. The role of RecA* is 
both to promote the autocatalytic cleavage of UmuD to UmuD’, and 
to activate the resulting UmuD’,C complex by transferring a single 
RecA subunit to pol V from its 3’-proximal tip. Activated pol V Mut 
can migrate to wherever it is needed; no RecA* filament needs be near 
the site where TLS actually takes place. In fact, no RecA nucleoprotein 
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Figure 4 | Pol V activation requires transfer of a RecA molecule from the 
DNA 3’ end of RecA*. a, The amount of RecA transferred to pol V from 
exposed 3’- or 5’-tips of RecA* are similar for wild-type RecA*, 

RecA E38KAC17* and RecA F117S*. b, DNA synthesis catalysed by 

pol V Mut—(wild-type RecA) or pol V Mut—(RecA E38KAC17) occurs when 
RecA is transferred from the 3’-RecA* tip (left gel). Pol V Mut cannot 
synthesize DNA when RecA transfer takes place from the 5'-RecA* tip (right 
gel). Mutagenically inactive RecA F117S* fails to activate pol V. 


filament needs to participate directly in TLS, either in cis or in trans, a 
fact obscured in previous studies because such filaments were always 
present when TLS was observed. Participation in TLS as a subunit of 
pol V Mut becomes the only filament-independent role documented 
for RecA. Rapid inactivation of pol V Mut after TLS, or slower in- 
activation if no suitable p/t is encountered, ensures that pol-V- 
catalysed error-prone DNA synthesis will cease soon after the 
RecA* filaments supporting the SOS response (and pol V reactiva- 
tion) are gone. The mutational load associated with pol V function is 
thereby minimized, effectively restricting it to periods when SOS is 
induced. 


UmuC 
pol V RR @ RecA-ATP 


replacement 


ng 
RecA* 


_* 3 


RecA* 


pol V mt RR ) FP Deactivated 
Deactivation (slow) poly Mut 


pol V Mut-DNA | Deactivation 
pol V Mut-DNA dissociation (rapid) 


i aa 


Figure 5 | Model depicting pol V activation, deactivation and reactivation. 
Pol V (UmuD’,C) is barely active in the absence of RecA*. A molecule of 
RecA (red circle) and ATP (green triangle) is transferred from the DNA 3’- 
end of RecA* to form a mutasomal complex (pol V Mut) containing 
UmuD’,C-RecA-ATP; this catalyses TLS in the absence of RecA*. After 
TLS, pol V Mut undergoes rapid deactivation upon dissociation from DNA, 
with RecA and ATP retained in the complex. When pol V Mut is activated, 
we surmise that RecA—ATP is bound to UmuD’ ), and that dissociation from 
p/t DNA triggers a repositioning of RecA—ATP to bind with UmuC to 
deactivate pol V Mut. Reactivation of pol V Mut involves a replacement of 
RecA-ATP from the RecA* 3'-tip. Free pol V Mut undergoes slow 
deactivation, with RecA and ATP retained in the complex. A damaged DNA 
base is shown as a black cross in the template strand. 


B® pol V Mut 
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We have shown that transfer from the 3’-tip of RecA* is an abso- 
lute requirement for pol V activation (Fig. 4b). Consistent with in 
vivo data”’, the mutagenically inactive RecA F117S is unable to activ- 
ate pol V for DNA synthesis (Fig. 4b, left gel), despite the transfer of 
RecA F117S from the 3'-tip of RecA F117S* (Fig. 4a, left gel). 
Notably, the mutation in RecA F117S is located at the surface facing 
3' ona RecA filament end”°. 

In the model (Fig. 5), we show RecA bound to UmuD’, when 
pol V Mut is either activated or reactivated, and to UmuC when 
pol V Mut is deactivated. This assignment is based on our previous 
identification of two distinct modes of RecA binding to pol V**. RecA 
and polV form a stably bound complex (dissociation constant 
Ky 250nM) through the UmuC subunit in the absence of DNA; 
we have called this ‘mode 1’ binding’. RecA also binds to pol V, with 
a similar apparent Kg, through its UmuD’, subunit, requiring DNA 
and ATP, but not ATP hydrolysis, referred to as ‘mode 2’ binding”. 
Therefore, we speculate that RecA binds proximally to UmuD’, during 
DNA synthesis, and that dissociation from p/t DNA triggers a rapid 
conformational rearrangement, repositioning RecA to bind with 
UmuC. When a newly activated pol V Mut is free in solution, the 
presumed conformational rearrangement shifting RecA binding from 
UmuD’, to UmuC occurs much more slowly (Fig. 1g). Subsequent 
reactivation of polV Mut (Fig. le-g) could occur by a RecA*- 
mediated replacement of RecA-ATP with the deactivated form of 
pol V Mut (Fig. 2f). 

A decade before the discovery of pol V, the term “mutasome” was 
coined by Echols to define the protein components that assemble at a 
lesion in the presence of a stalled replication complex to mediate 
TLS”. We now propose that the mutasome envisioned by Echols is 
a ‘stand alone’ activated multiprotein complex, pol V Mut, composed 
of UmuD',C—RecA—ATP. 


METHODS SUMMARY 


PolV was activated by incubation with RecA* immobilized on streptavidin- 
coated resin. RecA* was removed by centrifugation, leaving pol V Mut in the 
supernatant. The polymerase activity of pol V or pol V Mut was measured as the 
rate of deoxynucleotide addition to the 3'-OH end ofa hairpin p/t DNA contain- 
inga*’P label at its 5'-end. The sequences of template and biotinylated oligomers 
are shown in Supplementary Fig. 9. Gel band intensities were quantified by 
phosphorimaging with IMAGEQUANT software (Molecular Dynamics), and 
primer utilization was calculated from the integrated gel band intensities of 
extended p/t DNA divided by the total DNA intensity. Deactivated and reacti- 
vated His—pol V Mut were bound to a nickel-affinity column and washed twice 
with reaction buffer. Bound pol V Mut was eluted from the column by washing 
with buffer containing imidazole. The amount of Fl-RecA and [y-*°S] ATP 
remaining bound to pol V Mut were visualized by SDS-PAGE or by urea 
PAGE, respectively. For MALS analysis, pol V Mut was subjected to size-exclu- 
sion chromatography using a silica-based column (KW-802.5, Shodex). 
Chromatography was performed at 0.5 ml min! and the column effluent was 
passed directly on-line into a MALS detection system (Dawn Heleos, Wyatt 
Technology) to determine the molecular masses of eluted species; fractions were 
collected manually after passing through the light scattering detector. Data ana- 
lysis was performed with ASTRA software. 
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Contamination of the asteroid belt by primordial 


trans-Neptunian objects 


Harold F. Levison’”, William F. Bottke’”, Matthieu Gounelle®, Alessandro Morbidelli*, David Nesvorny’” 


& Kleomenis Tsiganis” 


The main asteroid belt, which inhabits a relatively narrow annulus 
~2.1-3.3 au from the Sun, contains a surprising diversity of objects 
ranging from primitive ice-rock mixtures to igneous rocks. The 
standard model used to explain this assumes that most asteroids 
formed in situ from a primordial disk that experienced radical 
chemical changes within this zone’. Here we show that the violent 
dynamical evolution of the giant-planet orbits required by the so- 
called Nice model”~ leads to the insertion of primitive trans- 
Neptunian objects into the outer belt. This result implies that 
the observed diversity of the asteroid belt is not a direct reflection 
of the intrinsic compositional variation of the proto-planetary 
disk. The dark captured bodies, composed of organic-rich materi- 
als, would have been more susceptible to collisional evolution than 
typical main-belt asteroids. Their weak nature makes them a pro- 
digious source of micrometeorites—sufficient to explain why 
most are primitive in composition and are isotopically different 
from most macroscopic meteorites”®. 

Our result that a significant fraction of objects in the main asteroid 
belt are captured primordial trans-Neptunian objects (hereafter 
termed ‘comets’ or ‘comet-like’ to reflect their primitive, icy natures) 
comes from a series of numerical calculations of the trajectories of 
small bodies during dynamical events proposed in the Nice model”. 
In this model, the giant planets are assumed to have formed in a 
compact configuration (all were located between 5 and 15 Au from 
the sun) and to be surrounded by a ~35Mga disk of comet-like 
planetesimals stretching between ~16 and ~30au from the sun 
(Mg denoting the mass of the Earth). After roughly 600 Myr, the 
orbits of the giant planets became unstable. Uranus and Neptune 
were gravitationally scattered outwards, thereby penetrating the disk 
and scattering its constituents throughout the Solar System. The 
planets evolved into their current orbits as a result of the gravitational 
effects of the ‘comets’. This model is unique because it can quantita- 
tively explain many of the observed characteristics of the Solar 
System (for example giant-planet orbits*, Trojan asteroids’, the 
Kuiper belt®, the irregular satellites’ and the origin of the so-called 
late heavy bombardment’). 

Our current simulations follow the dynamical evolution of objects 
that originally formed in the Nice model’s primordial comet disk as 
the giant planets’ orbits evolved. We are primarily interested in the 
fate of objects that potentially could have entered the inner Solar 
System. Thus, we first integrated the orbits of a large number of 
massless planetesimals initially on Saturn-crossing orbits under the 
gravitational influence of the Sun, Jupiter and Saturn. The planets 
were forced to migrate by including a suitably chosen acceleration in 
the planets’ equations of motion, so that they reproduced the evolu- 
tion of the ‘fast migration’ run in ref. 3 (see Supplementary 


Information, section $1, for more details). During the integration, 
we supplied a steady flux of planetesimals through the Jupiter-Saturn 
system. These objects represent the planetesimals that originally 
formed in the trans-planetary disk, but were destabilized and fed 
inwards by Uranus and Neptune. The integrations covered 10 Myr. 
In all, we followed the evolution of ~31,000,000 disk particles. We 
find that a significant number of objects are captured either in Trojan 
orbits (as in ref. 3) or in orbits inside that of Jupiter. A Jupiter-cross- 
ing object can evolve into a stable orbit in the main asteroid belt 
through a combination of dynamical processes. Initially, Jupiter will 
scatter objects and occasionally one will fall in a mean-motion res- 
onance, which can lower eccentricity. This would be temporary if 
there were no migration. With migration, however, the resonances 
move, and low-eccentricity objects can be left behind. 

Next we tracked the long-term evolution of this captured comet- 
like population to compare it with observations. This population is 
affected by two processes: dynamical erosion and collisional grind- 
ing. To address the first, we integrated the orbits of the implanted 
comets under the gravitational effects of the Sun and gas giants for 
4 Gyr. We removed an object if it either collided with a planet or 
reached a heliocentric distance greater than 15 AU or less than 1.5 AU. 
Figure 1 shows a comparison between the surviving objects (red dots) 
and a complete sample of known large asteroids (green pluses). We 
find that a substantial population of Trojans (32 of the 230 captured 
objects, or 13%) and Hilda asteroids (8%) are produced. 

In addition, we captured comets in orbits in the main asteroid belt 
with semi-major axes as low as 2.68 AU. To compare these objects 
with the rest of the main belt, we used classification based on spec- 
troscopy. According to our model, objects captured in the main belt 
should physically be similar to the resonant Trojans and Hildas. The 
vast majority of these resonant objects are classified as D- or P-type 
(hereafter D/P-type), which probably are organic-rich. Indeed, they 
are a good match to spectral characteristics of the observed dormant 
comets’®. This is consistent with the Nice model’s prediction that 
they formed beyond 15 au. 

The black dots in Fig. 1 show the D-type asteroids taken from the 
databases of refs 11, 12. There is reasonable agreement between the 
orbital element distribution of our captured objects and the known 
primitive asteroids (Trojans, Hildas and D-type main-belt objects). 
This result naturally leads to the controversial idea that most prim- 
itive asteroids formed beyond ~15 au. 

If the above model is correct, we need not only to be able to 
reproduce the orbits of the primitive, comet-like asteroids, but their 
total number and size distribution as well. To accomplish this, we 
need to account for their collisional evolution, which we do with a 
self-consistent code, CoDDEM, capable of following the collisional 
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Figure 1| The orbital element distributions of real and modelled asteroids. 
The distributions of eccentricity, e (a), and inclination, i (b), as functions of 
semi-major axis, a, of asteroids in Hilda (objects which orbit in the 2:3 mean- 
motion resonance with Jupiter near 3.9 aU), Trojan and main-belt 
populations. The small green pluses show all the numbered objects with 
diameters greater than 40 km (the asteroid belt is probably complete at these 
sizes) in the International Astronomical Union’s Minor Planet Center 
database. We calculated the diameters from published absolute magnitudes, 
assuming the albedos (Supplementary Information, section $2.1). The black 
symbols show the D-type asteroids as catalogued in refs 11 and 12. It is 
important to note that asteroid (336) Lacadiera, at a = 2.25 Au, e = 0.1 and 
i= 5.6°, which is classified as a D-type in ref. 11, has an unusual spectrum 
(H. Campins, personal communication), and thus is probably a different 
type of object. We therefore use a smaller symbol to plot the location of this 
object and did not include it in our analysis. We only include D-type 
asteroids here because it is difficult to distinguish P-types from other, more 
processed, asteroids, with the result that the catalogue of P-types probably 
suffers from significant contamination*’. The red dots show the location of 
objects captured during our simulations. The main result of our simulations 
is that, in addition to the resonant asteroids, a significant number of objects 
are trapped in the outer main belt (OMB). Unfortunately, it is not possible to 
perform a direct comparison between the orbital element distribution of our 
trapped objects and that of the D-types because the observations are biased 
owing to selection criteria, asteroid families and the like. However, we find 
noteworthy the fact that the inner edge of each populations is at a ~ 2.6 AU. 


evolution and dynamical depletion of multiple interacting size-fre- 
quency distributions (SFDs)’*"*. This code was modified to use an 
improved algorithm for handling the outcome of individual colli- 
sions, taken from ref. 15. We tracked five populations: (1) indigenous 
main-belt objects (a< 2.82 Au), (2) indigenous OMB objects, (3) 
captured OMB objects, (4) Hildas and (5) Trojans. The collision 
probabilities and impact velocities with both themselves and each 
other were computed from the observed objects or, in the case of the 
Trojans, were taken from refs 16-18. Populations 1 and 2, which 
represent native asteroids, had a bulk density of 2.7 gcm™* and were 
assumed to follow the disruption scaling law for rocky objects'*. We 
also assumed that the indigenous main-belt SFDs had approximately 
the same shape 3.9 Gyr ago as they do today’*™. 

The captured populations (3, 4 and 5) have a bulk density of 
lgcm ”. We assumed that the initial shape of their SFDs was the 
same as that of the currently observed Trojans (Fig. 2 and 
Supplementary Information, section $2.2). Because the cometary 
disruption scaling law is not well understood, we performed four 
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Figure 2 | The beginning and end states of the SFDs in the three regions 
studied. a, Trojans; b, Hildas; c, OMB objects. The blue curves show the 
initial conditions for the captured comets in our CODDEM simulations. The 
shapes of the initial SFDs for these populations were taken from the currently 
observed Trojan SFD. We chose the Trojans because these objects have been 
least affected by collisional grinding. In particular, using the cumulative 
number N(>D) « D4, we assigned values of q = 5.5 for D> 105 km and 
q = 1.8 for D< 105 km, where D is diameter. The overall scales of the SFDs 
for the captured populations are derived from a combination of the observed 
Trojans and the results of the 10-Myr migration simulations (Supplementary 
Information, section $2.2). For each value of fg, we performed ~10 
simulations using different random number generator seeds. The red curves 
show an average of these simulations and thus are our predictions for the 
present-day SFDs of the captured objects. In particular, the dotted, solid, 
dashed, and dash-dot red curves correspond to fg = 1, 3, 5 and 8, 
respectively. For the Hildas and Trojans, we expect the model SFD to match 
observations (black curves; note that the roll-off at small sizes is due to 
observational incompleteness). The agreement is quite good given the 
uncertainties in our models (Supplementary Information). Of particular 
note, the Trojans are known to have a significantly steeper SFD at 

D = 100km than the Hildas. Our model reproduces this because the Hildas, 
which cross the main belt, undergo more collisional grinding. For the main 
belt, the model also predicts that the SFD of the primitive bodies is steeper 
than that of the indigenous population for D = 20 km, so the fraction of 
D/P-types should increase as diameter decreases. This appears to match 
observations”. The solid and dotted green curves in ¢ refer to the indigenous 
OMB objects at the beginning and end of the simulation, respectively. 


series of simulations in which this parameter was varied. In particu- 
lar, following ref. 19, we assumed that the amount of energy required 
to catastrophically disrupt a comet is the same as that for solid ice”, 
divided by a factor fg (Supplementary Information, section $2.2). We 
studied fg = 1, 3, 5 and 8 (the limit found in ref. 19). The required 
dynamical depletion rates were taken from the dynamical simula- 
tions described above, whereas those of the indigenous asteroids 
came from existing calculations of asteroid-belt depletion’ in the 
Nice model (see the discussion about population 3 in Supplemen- 
tary Information, section S1). 

The results of our collisional calculations are shown in Fig. 2. Our 
model reproduces the SFDs of both the Trojan and Hilda populations 
for diameter D> 40km fairly well. This agreement is important 
because each population has unique characteristics that test our 
model’s assumptions. For example, comminution in the Trojan 
population comes mainly from members hitting one other, whereas 
that in the Hilda population is driven by impactors from across the 
asteroid belt. Matching the SFDs of these populations simultaneously 
in the same model, therefore, increases our confidence that our 
assumptions are reasonable. 

The final state of the captured cometary OMB is strongly depend- 
ent on the disruption scaling laws used. For fg = 1, the model pre- 
dicts that all of the OMB objects with D> 40 km are captured. For 
fo = 3, 5and 8 we find that captured objects represent 48%, 20% and 
16% of the OMB, respectively. This decrease is due to a much larger 
fraction of the captured objects being destroyed by impacts than their 
stronger indigenous counterparts. The best available bias-corrected 
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estimates of the fraction of D/P-types within the OMB over the same 
size range is ~20% (refs 21-23), suggesting that fg 2 5. Because the 
proportion of D/P-types is highly uncertain, this limit for fg is prob- 
ably lower. The requirement that the primordial cometary disk sur- 
vives the ~600 Myr before the late heavy bombardment requires that 
fo<5 (Supplementary Information, section $4), which is consistent 
with our upper range. We conclude, therefore, that captured comets 
are weaker than native asteroids. 

If the implanted comets are weaker than the indigenous main-belt 
asteroids, we can potentially resolve another long-standing problem 
in planetary science. The micrometeorites collected on the Earth are 
dominated by material reminiscent of carbonaceous chondrites. 
Indeed, the ratio of ordinary chondrite-type material to carbona- 
ceous chondrite-type material (hereafter O/C) in micrometeorites 
is $0.16 (ref. 24). Yet the asteroid belt, which has traditionally been 
considered the dominant source of this material”? (see 
Supplementary Information, section $3, for alternatives), is roughly 
an equal mix of ordinary chondrite-like asteroids (for example 
S-types) and carbonaceous chondrite-like asteroids (for example 
C-types). The question has thus been that of why the micrometeorites 
we collect do not reflect this ratio. 

In our successful collisional models (that is, those with fg = 3), we 
find that, at the current epoch, the implanted comets produce more 
than 15 times more dust per unit mass than the indigenous asteroids 
(the exact value depending on fg). This is because the implanted 
population is made up of weaker objects that are breaking up more 
easily. Furthermore, because the material created is mineralogically 
similar to that produced by primitive C-type asteroids, our model 
can explain the overabundance of carbonaceous chondrite microme- 
teorites. Indeed, collisional simulations of the current asteroid belt 
(Supplementary Information, section $3) show that an observed 
value of O/C < 0.16 is a common occurrence if fg = 3. 

The idea that D/P-types are both weak and the source of micro- 
meteorites also provides an explanation for the surprising fact that 
there are few micrometeorite analogues in our macroscopic meteor- 
ite collection. If we suppose that these micrometeorites are generated 
by the collisional cascade of embedded comets, we would not expect 
to see many macroscopic samples for two reasons. First, our 
implanted population resides in a region of the main belt from where 
it is very difficult to get meteorites*®’’. In particular, macroscopic 
objects leaving this region of the asteroid belt are most likely to be 
ejected from the Solar System by Jupiter, whereas microscopic 
bodies, which are more susceptible to radiation forces, are more likely 
to be delivered to the Earth. In addition, the presumed weak nature of 
the captured objects probably makes their macroscopic fragments 
unlikely to survive the passage through the Earth’s atmosphere or, 
if they do, survive for long on the Earth’s surface. 

Our conclusion that most primitive asteroids formed at large 
heliocentric distances and then were trapped in their current orbits, 
coupled with the idea that at least some asteroids were scattered into 
the main belt from the terrestrial-planet region”’, changes our view of 
the asteroid belt. The traditional interpretation of the diversity of the 
asteroid belt is that it represents the original condensation sequence 
in the protoplanetary disk’. Indeed, their orbital distribution has 
been used to constrain both the thermal structure of the nebula 
and the effectiveness of various heating mechanisms as a function 
of heliocentric distance (for example the decay of short-lived radio- 
nuclides such as *°Al)”’. If most D/P-types in the inner solar system 
were captured from farther out, however, our models of the proto- 
planetary disk would have to be significantly revised. Indeed, the 
diversity in the asteroid belt may be telling us more about the dynam- 
ical processes that controlled planet formation than about the phys- 
ical nature of the protoplanetary nebula, with the main asteroid belt a 
collection point for rogue planetesimals from across the Solar 
System. 
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Manipulation of photons at the surface of 
three-dimensional photonic crystals 
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In three-dimensional (3D) photonic crystals’, refractive-index 
variations with a periodicity comparable to the wavelength of the 
light passing through the crystal give rise to so-called photonic 
bandgaps, which are analogous to electronic bandgaps for electrons 
moving in the periodic electrostatic potential of a material’s crystal 
structure. Such 3D photonic bandgap crystals are envisioned to 
become fundamental building blocks for the control and manipula- 
tion of photons in optical circuits. So far, such schemes have been 
pursued by embedding artificial defects*>*’ and light emitters*” 
inside the crystals, making use of 3D bandgap directional effects. 
Here we show experimentally that photons can be controlled and 
manipulated even at the ‘surface’ of 3D photonic crystals, where 3D 
periodicity is terminated, establishing a new and versatile route for 
photon manipulation. By making use of an evanescent-mode coup- 
ling technique, we demonstrate that 3D photonic crystals possess 
two-dimensional surface states, and we map their band structure. 
We show that photons can be confined and propagate through these 
two-dimensional surface states, and we realize their localization at 
arbitrary surface points by designing artificial surface-defect struc- 
tures through the formation of a surface-mode gap. Surprisingly, 
the quality factors of the surface-defect mode are the largest 
reported for 3D photonic crystal nanocavities (Q up to ~9,000). 
In addition to providing a new approach for photon manipulation 
by photonic crystals, our findings are relevant for the generation 
and control of plasmon-polaritons in metals and the related surface 
photon physics. The absorption-free nature of the 3D photonic 
crystal surface may enable new sensing applications and provide 
routes for the realization of efficient light—matter interactions. 

Surface states in 3D photonic crystals were initially discussed 
theoretically with a view to preventing them from disturbing the 
optical properties of embedded artificial defects’*. If we could instead 
make use of the surface states positively to manipulate photons, it 
would open up a new route for the manipulation of photons by 3D 
photonic crystals, through which various types of access from outside 
would become straightforward compared with those formed inside 
the 3D photonic crystals. Such studies would simultaneously lead to 
the establishment of the surface science of 3D photonic crystals, and 
have great relevance to the surface plasmon-polariton effect of metals 
and the related surface photonic physics. However, experimental 
investigations of the manipulation of photons at the surface of 3D 
photonic crystals had not yet been performed. Here we demonstrate 
the presence of surface states at the surface of 3D photonic crystals, 
and show that the confinement and propagation of photons via the 
surface states is indeed possible. We also control the surface states, 
and demonstrate that the photons can be localized at desired and 
arbitrary surface positions by the formation of the surface-mode gap 
and the introduction of artificial localized surface-defect states. 

The 3D photonic crystals we investigated have a stacked-stripe (or 
woodpile) structure (Fig. 1a) with a full 3D photonic bandgap*'*”. 


The in-plane band structures of the 3D photonic crystals without and 
with the ‘surface’ are shown in Fig. 1b and c. The width and thickness 
of the individual dielectric rods were both set at 0.44, where a is 
the centre-to-centre separation of the parallel dielectric rods. It is 
apparent from Fig. 1c (see red lines) that the surface states are formed 
in the frequency region where the bandgap would otherwise exist 
(Fig. 1b). Owing to the structural anisotropy for the x-direction 
and the y-direction at the surface (see inset in Fig. la), the dispersion 
relation of the surface states are different for the [—X, direction and 
the IX, direction, respectively. The inset in Fig. lc shows an 
example of the electromagnetic-field distribution of a particular sur- 
face state at the blue point. This confirmed that the photon mode is 
indeed formed at the surface of the 3D photonic crystal. 

To demonstrate experimentally the presence of the surface states, 
we fabricated 3D photonic crystals with gallium arsenide (GaAs)- 
based eight-stacked-stripe layers*’®'’, in which a was set at 500 nm, 
and the width and thickness of the individual dielectric rods were both 
set at 200 nm (=0.4a). Figure 2a (left-hand side) shows a scanning 
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Figure 1| 3D photonic crystal and band structures. a, Schematic structure 
of the 3D photonic crystals, and representative directions in real and 
reciprocal spaces. Owing to the anisotropy of the surface structure for the 
x-direction and the y-direction, we need to distinguish the [—X, direction 
and the I —X, direction. b, ¢, In-plane band structures for the 3D photonic 
crystal without (b) and with (c) ‘surfaces’. The width and thickness of 
component dielectric rods are both set at 0.4a. The inset in ¢ shows an 
example of the electric-field distribution E of a surface state with an in-plane 
wavenumber k,, = 0.252 X(2n/a) for the !—M direction and a frequency of 
f = 0.318c/a (blue circle). PC, photonic crystal. 
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Figure 2 | Experimental demonstrations of the surface states of 3D 
photonic crystals. a, Scanning electron microscope image of the fabricated 
3D photonic crystal (left panel) and the experimental set-up used to couple 
light into the surface modes (right panel). b, Measured dispersion 
characteristics of the surface modes for two representative directions, [—-M 
and I —X). The inset shows one example of the reflection spectrum at 

0 = 52.3° for the [—M direction, in which the minimum value of the 
reflection spectrum at f = 0.382c/a corresponds to the white circle in the 
obtained dispersion curve. ¢, Optical microscope image of the surface of the 
3D photonic crystal, showing the excitation point of the incident light and 
the edge of the photonic crystal. d-f, Experimental results for light 
propagation through the surface mode, in which the irradiating angle 0 was 
fixed at 45.7°, and the wavelength was changed to 1,400 nm (d), 1,430nm 
(e) or 1,460 nm (f). g, Scattered intensity at the edge of the 3D photonic 
crystal as a function of the distance between the excitation point and the 
crystal edge. 


electron microscope image of the fabricated 3D photonic crystal. We 
initially measured the transmission spectra of the sample for various 
incident angles, and confirmed that the fabricated photonic crystals 
had full 3D bandgaps at wavelengths of 1,300-1,600nm (see 
Supplementary Information). We then measured the band structure 
of the surface states in the 3D photonic crystal. When photonic bands 
are formed above the air light-line, direct irradiation of light to the 
photonic crystals'**° can be used. However, in the present case, as the 
surface states are formed below the air light-line, the coupling of light 
to the surface states is difficult using such a direct irradiation method. 
Thus, we used an evanescent-coupling method with a prism*** to 
excite the surface states (Fig. 2a, right-hand side). 

Light with wavelength 2) in vacuum was introduced to the bottom 
face of the prism with a refractive index (Mp;ism) Of 1.5 at an angle of 0, 
which was larger than the critical angle of the total internal reflection. 
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In this situation, the light is completely reflected but the evanescent 
wave can penetrate the bottom face of the prism. We then placed the 
photonic crystal below the bottom surface of the prism at a distance 
of ~500 nm, so that evanescent coupling of the light from the prism 
to the surface modes of the photonic crystal was achieved. The in- 
plane wavenumber k,, of the coupled evanescent wave can be 
expressed as Mprismkosin0, where ko is the wavenumber in the vacuum 
(=21//9). When the evanescent wave is coupled to the surface mode 
of the 3D photonic crystal, the optical power reflected at the bottom 
of the prism is reduced owing to the energy transfer to the surface 
mode of the 3D photonic crystal. By measuring the reflection spectra 
for various incident angles of 0 and finding the frequency at which 
the minimum reflection is realized, we can derive the dispersion 
characteristics of the surface states for various photonic-crystal direc- 
tions. Figure 2b shows the measured dispersion characteristics of the 
surface modes for two representative directions: [—M and I—X,. The 
inset in Fig. 2b shows one example of the reflection spectrum at 
0 =52.3° for the T—M direction, where the minimum value at 
frequency f= 0.382c/a, where c is the velocity of light in vacuum, 
corresponded to the white circle in the obtained dispersion curve. 
The measured results agreed with the theoretical dispersion curve 
(Fig. 1c). These results clearly indicate that surface states are indeed 
formed in the 3D photonic crystal. 

Next, we investigated whether photons could be confined and 
propagate through the surface modes of the 3D photonic crystals. 
We directly observed the propagation of photons via the surface state 
of the 3D photonic crystal. The surface state was excited by the 
evanescent-coupling method described above except that we placed 
the prism below the substrate of the photonic crystal and the light was 
irradiated from the bottom side. Figure 2c shows an optical micro- 
scope image of the photonic-crystal surface observed from the top 
side using an infrared camera. The incident angle 0 was fixed at 45.7°, 
and the wavelength was changed. The measured dispersion curve 
(Fig. 2b) suggested that light with a specific wavelength of 
~1,430nm could be coupled to the surface mode and propagate. 
Figure 2d—f shows the experimental results. Light with a wavelength 
of 1,430nm propagated through the surface of the 3D photonic 
crystal, and was scattered at the end and at the surface (the surface 
scattering being caused by minor, unintentional irregularities of the 
surface of the fabricated crystal) (Fig. 2e). In contrast, light with 
wavelengths of 1,400 and 1,460 nm could not propagate because of 
the lack of surface modes for the incident angle 0 = 45.7° (Fig. 2d 
and f). When the incident angle 0 changed, the wavelength of light 
that could propagate also changed. 

To estimate the propagation length via the surface states, the inten- 
sity of scattered light at the end of the photonic crystal was measured 
by changing the distance between the excitation point and the crystal 
edge. The intensity of scattered light exponentially decreased as the 
distance between the excitation point and the crystal edge increased 
(Fig. 2g). By evaluating the length at which the intensity of scattered 
light became 1/e, where e is the base of the natural logarithm, the 
propagation length was estimated to be ~385 jim. This is reasonable, 
given that the stacked 3D photonic crystal had eight layers. 

These experiments clearly indicate that photons can indeed be 
confined and propagate through surface states. However, the surface 
states were not spatially controlled, so photons inevitably spread in 
various in-plane directions. To manipulate photons arbitrarily at the 
surface of 3D photonic crystals, it is important to realize the spatial 
control and localization of photons at desired surface positions. To 
achieve this, we formed a surface-mode gap, by which the in-plane 
propagation of photons can be inhibited. We could then form the 
desired localized state at a specific position by introducing artificial 
surface defects. The theoretical calculation revealed that changing the 
dielectric arrangement of the surface layer of the 3D photonic crystal 
from a striped to a crossed geometry could achieve a surface-mode 
gap (Fig. 3a). We therefore fabricated 3D photonic crystals with 
modified surface structures in which the additional rod widths were 
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Figure 3 | Formation of a surface-mode gap by modification of the surface 
structure. a, Schematic showing the modification of the surface structure 
from a striped geometry to a crossed geometry by adding additional 
dielectrics. b, c, The measured dispersion characteristics for the photonic 
crystals with a crossed geometric surface. The widths of the additional rods 
were 80 nm (=0.16a) (b) and 200 nm (=0.4a) (c). 


set at 80 and 200 nm, respectively (Fig. 3b and c). The measured 
dispersion characteristics are shown in Fig. 3b and c, respectively. 
It is clear that adding dielectric rods successfully opened the surface- 
mode gap (Fig. 3b and c, white dashed lines). The experimental 
results agreed well with the theoretical calculation. It is also note- 
worthy that the surface-mode gap occurred for both polarizations of 
the transverse electric and the transverse magnetic modes for any in- 
plane direction. 

Our next step was the introduction of an artificial surface-defect 
structure. For this purpose, we partially increased the width of the 
dielectric rod, as shown in Fig. 4a. The calculated electric-field dis- 
tribution of the defect mode revealed that a localized photon mode 
can be formed at the surface of the 3D photonic crystal (Fig. 4b). 
Hence, we fabricated the surface-defect structure shown in the inset 
in Fig. 4c, and measured the optical properties of the defect mode 
using the method shown, in which the excitation light was incident to 
the surface defect with an angle of 45° from the normal direction 
using a 50 X objective lens (numerical aperture = 0.42), and the radi- 
ation from the defect was detected by the same lens. An example of 
the micro-mapping of the response of the artificial defect mode is 
shown in Fig. 4d. The spatially localized photon state indeed formed 
at the surface. Figure 4e shows the series of resonant spectra from the 
artificial surface defects with various lengths Lg. The surface defect 
had a clear resonance curve, and possessed fundamental and higher 
modes. The resonant wavelengths shifted to the longer wavelength 
side as the defect length Lg increased. These results agreed well with 
the theoretical calculation. Figure 4f shows the Q factors of individual 
defect modes. Surprisingly, the highest Q factor reached ~9,000, 
which is the highest value reported for 3D photonic crystals. 
Theoretically, increasing the stacking number of the 3D photonic 
crystals above eight layers could produce Q factors exceeding 100,000. 

This work represents an important step in realizing a new route for 
the manipulation of photons by 3D photonic crystals, as well as 
establishing the surface science of 3D photonic crystals. The experi- 
mental demonstration of a mode gap in the surface states has other 
important implications, such as the protection of interior localized 
states and the efficient coupling of light from an external medium to a 
waveguide within the 3D photonic crystal without loss to undesirable 
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Figure 4 | Artificial surface-defect structure and its characteristics. 

a, Schematic example of a surface-defect structure, in which the width of the 
dielectric rod was partially widened. b, Electric-field distribution E of the 
surface-defect mode, in which the width Wg and the length Ly of the widened 
dielectric rod were 0.52a and 3a, respectively. c, Set-up for measuring 
radiation spectra from surface-defect modes, in which an excitation light was 
incident to the surface defect with an angle of 45° from the normal direction 
using a 50% objective lens (numerical aperture = 0.42), and the radiation 
from the defect was detected by the same lens. The inset shows an example of 
the fabricated surface defect structure, in which Wg and La were 275 nm 
(=0.55a) and 3a, respectively. d, An example of the micro-mapping of the 
response of the surface defect of Lg = 3a at the resonant wavelength of 
1,446 nm. e, Series of resonant spectra from the artificial surface defects with 
various lengths Ly between 1a and 7a. (a.u., arbitrary units.) f, Q factors of 
individual defect modes. The inset shows an example of a resonant spectrum 
for the surface-defect mode with the highest Q factor (~9,000). 


surface states. This work also has interesting analogies to the surface 
plasmon-polariton effect of metals and the related surface photon 
physics. The absorption-free nature of the 3D photonic-crystal sur- 
face means that various new sensing applications can be expected. 
Furthermore, various light—matter interactions could be achieved at 
the surface of 3D photonic crystals. 


METHODS SUMMARY 


The numerical analysis of the surface properties was performed by electromag- 
netic calculations using the 3D finite-difference time-domain (FDTD) method” 
with absorbing boundaries**”’. Photonic crystals with eight stacked layers were 
fabricated by stacking GaAs stripes®"®: initially, striped patterns were formed by 
electron-beam lithography and reactive ion etching in a single crystalline GaAs 


layer on an aluminium (Al)GaAs etch-stop layer and a GaAs substrate. A pair of 
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patterned wafers was stacked and bonded with a crossed configuration using a 
very accurate alignment system’’, and one of the substrates was removed. These 
processes were repeated until the desired structure was constructed. The in-plane 
area of the photonic crystal was set at 400 jtm X 400 um. The resulting structural 
misalignment was <80 nm (<0.16a). For the measurements of the band struc- 
tures (Figs 2b and 3b, c), white light from a halogen lamp with a focal spot of 
~60 jum by a 5X objective lens was used, and the reflection spectra were mea- 
sured by a spectrometer with an indium (In)GaAs multi-channel detector. For 
the measurement of the light propagation via the surface state (Fig. 2d—-f), mono- 
chromatic light from a tuneable continuous-wave semiconductor laser was used. 
The images were observed from the top side using a 20 objective lens and an 
InGaAs charge-coupled device (CCD) camera (320 X 256 pixels). For the mea- 
surements of the optical properties of the surface-defect modes, white light from 
a super-continuum source with a focal spot of ~4 um by a50X objective lens was 
used, and the radiation spectra were measured using a high-resolution spectro- 
meter. The Q factors were estimated using the formula Q = A9/AAo, where Jp and 
Ado are the measured central wavelength and the full-width at half-maximum 
(FWHM) of the radiation peaks, respectively. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 

Numerical calculations of band structures and surface-defect modes. The 
numerical calculations were performed using the 3D FDTD method”. For the 
structure without the surface (Fig. 1b), the Bloch’s periodic boundary was used 
for all directions. A unit cell of the calculation was set to be a X a X 4h, where ais the 
centre-to-centre separation of the parallel dielectric rods and his the thickness of the 
rod. The refractive index n of the dielectric rods was set to be 3.375, based on GaAs in 
the optical communications wavelength range. The mesh size for the FDTD calcu- 
lations was set to be a/16 for both the x-direction and the y-direction, and a/15 for 
the z-direction. The shadowed band areas shown in Fig. 1b were derived by project- 
ing the modal frequency of the 3D wavevector k to their in-plane wavevector k,. For 
the structure with the surface (Fig. 1c), the Bloch’s periodic boundary was used for 
the in-plane directions and the Mur’s second-order absorbing boundary”® was used 
for the vertical direction. The mesh size was set to be a/16 for all directions. The 
whole structure was constructed with an air region (1 = 1), a 3D photonic crystal 
and a dielectric material as a substrate (n = 3.375), forming the surface modes on 
only one side of the photonic crystal edge, which is the interface between the 
photonic crystal and the air region. The electromagnetic field distribution (inset 
in Fig. 1c) was also calculated using this model, with the in-plane wavenumber k,, 
fixed at a specific value of: 0.25)2 X (2n/a) for the PM direction. When calculating 
the properties of the surface-defect mode, a structural model with a photonic crystal 
having an in-plane size of 16a X 16a and a dielectric substrate was considered. The 
absorbing boundary of the Berenger’s perfectly matched layer’’ was used in three 
dimensions, and the mesh size used for the calculation was set at a/16. 
Measurements of band structures and light propagation via the surface state. 
The evanescent-coupling method with a prism?! was used to excite the surface 
modes. For the measurements of the band structure of the surface modes, a prism 
with a refractive index Mp,ism Of 1.5 was positioned above the surface with an air 
gap of ~500 nm (Fig. 2a). A 5X objective lens (numerical aperture = 0.14) was 
used to focus the incident light onto an area with a diameter of ~60 lm, which 


nature 


was sufficiently smaller than the area of the fabricated photonic crystal 
(400 um X 400 um). In the measurement using the prism of Mpyism = 1.5, the 
critical angle 0. of the total reflection at the bottom face of the prism was 
41.8°. We used various values of the incident angle 0 (43.0°, 44.3°, 45.7°, 
47.0°, 48.3°, 50.3°, 52.3°, 54.9°, 57.5°, 60.1° and 64.5°) to change the in-plane 
wavenumber kj. For each angle, the reflection spectra were measured using a 
spectrometer in the wavelength range of 1,140 to 1,650 nm. The inset in Fig. 2b 
illustrates an example of spectrum obtained in this way. The band structures 
shown in Fig. 2b and Fig. 3b and c were obtained from the reflectance spectra of 
all values of angle 0, where the results for two polarizations in which the vectors 
of the electric field were parallel and perpendicular to the plane of incidence, 
namely the P and S polarizations, were superposed. 

For the measurement of the light propagation via the surface mode (Fig. 2c—f), 

the position of the prism was changed and replaced at the bottom of the GaAs 
substrate. The irradiated light was from a semiconductor laser with a wavelength 
that could be tuned from 1,270 to 1,630 nm, and the images were detected by an 
infrared camera. For the estimation of the propagation length, the light intensity 
scattered at the edge of the photonic crystal was measured and the background 
intensity was subtracted. The scattered intensity values were plotted for various 
distances between the edge and the excitation point (Fig. 2g), and the relation 
between the distance and the intensity was fitted using an exponential function 
by least-squares approximation. 
Measurements of the characteristics of surface-defect modes. These measure- 
ments were performed using the method shown in Fig. 4c, in which the surface- 
defect modes were excited by white light generated by a super-continuum 
source. To eliminate the influence of the light reflected directly from the surface 
of the 3D photonic crystal, the irradiating angle was set to be 45°. Furthermore, 
two sets of polarizers with polarization directions that differed by 90° were 
inserted in front of the light source and the detector, respectively, to eliminate 
the background noise as much as possible. 
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Photoconductance and inverse photoconductance in 
films of functionalized metal nanoparticles 


Hideyuki Nakanishi’, Kyle J. M. Bishop’, Bartlomiej Kowalczyk’”, Abraham Nitzan'”, Emily A. Weiss’, 
Konstantin V. Tretiakov?, Mario M. Apodaca’, Rafal Klajn’, J. Fraser Stoddart’ & Bartosz A. Grzybowski’” 


In traditional photoconductors’’, the impinging light generates 
mobile charge carriers in the valence and/or conduction bands, 
causing the material’s conductivity to increase*. Such positive 
photoconductance is observed in both bulk and nanostructured*® 
photoconductors. Here we describe a class of nanoparticle-based 
materials whose conductivity can either increase or decrease on 
irradiation with visible light of wavelengths close to the particles’ 
surface plasmon resonance. The remarkable feature of these plas- 
monic materials is that the sign of the conductivity change and the 
nature of the electron transport between the nanoparticles depend 
on the molecules comprising the self-assembled monolayers 
(SAMs)’”* stabilizing the nanoparticles. For SAMs made of elec- 
trically neutral (polar and non-polar) molecules, conductivity 
increases on irradiation. If, however, the SAMs contain electrically 
charged (either negatively or positively) groups, conductivity 
decreases. The optical and electrical characteristics of these prev- 
iously undescribed inverse photoconductors can be engineered 
flexibly by adjusting the material properties of the nanoparticles 
and of the coating SAMs. In particular, in films comprising mix- 
tures of different nanoparticles or nanoparticles coated with 
mixed SAMs, the overall photoconductance is a weighted average 
of the changes induced by the individual components. These and 
other observations can be rationalized in terms of light-induced 
creation of mobile charge carriers whose transport through the 
charged SAMs is inhibited by carrier trapping in transient 
polaron-like states”'®. The nanoparticle-based photoconductors 
we describe could have uses in chemical sensors and/or in con- 
junction with flexible substrates. 

We used gold and silver nanoparticles (diameter 5.6 + 0.8 nm, 
with surface plasmon resonance (SPR) maxima in solution of 
respectively AA" ~520nm and 248. ~420nm) synthesized as 
described before'''’; these were stabilized by monocomponent or 
mixed SAMs of alkane thiols (Fig. la) abbreviated as C3;0OH, 
CsOH, C);OH, CjpCOOH, C\;NMe3", Cs, PhOH and PhCOOH. 
The suspensions of the nanoparticles were drop-cast onto a glass or 
Teflon substrate, and the solvent was evaporated under vacuum to 
give nanoparticle films of thickness (h) 120-300 nm (determined by 
profilometry for each sample), and characterized by SPR bands red- 
shifted (by ~50-60nm) and broadened owing to nanoparticle 
aggregation’*"* (Fig. 1c). Next, Pd/Au electrodes (width w= 5mm, 
separated by a gap of L=50 um; Fig. 1b) were sputtered onto the 
films, which were then dried under high vacuum for at least one week 
to remove traces of solvent and water. The samples were placed in a 
custom-made, hermetically sealed Faraday cage under inert atmo- 
sphere and in the presence of phosphorus pentoxide, a dehydrating 
agent. The electrodes were connected to a Keithley 6517 electrometer, 
a white-light LED (light-emitting diode) was placed above the film, 


and optical bandpass filters were used to select the desired wavelength 
of light. Unless otherwise noted, the temperature during all experi- 
ments was kept constant to within 0.02 °C (for further experimental 
details, see Supplementary Information section 1). 

In the absence of irradiation, both Au and Ag films exhibit ohmic 
current density—applied field (j-E) characteristics in the field intens- 
ity range from 100 Vm! (corresponding to 0.005 V across the sys- 
tem and sub-picoamp currents at the sensitivity limit of the 
electrometer) to 20kV m7! (higher fields cause irreversible changes 
in the nanoparticle films and coalescence of the neighbouring nano- 
particles). 

Irradiation with visible light alters the currents through the nanos- 
tructured films (Fig. 2). We note that the signs of the observed 
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Figure 1 | Functionalized nanoparticles for photoconductive films. 

a, Structures of alkane thiols used to form SAMs on gold and/or silver 
nanoparticles. b, Diagram of the experimental set-up (not to scale). The 
thickness of the nanoparticle layer is ~120—300 nm. Thinner layers give 
currents of magnitude comparable to the external/instrument noise. 

c, Ultraviolet/visible spectra of Au/C,OH (top) and Ag/C,OH (bottom) 
nanoparticles in solution and when cast onto glass films. Note that owing to 
aggregation of the nanoparticles'”"’, the spectra of the particles in the films 
are broadened and red-shifted compared to those of free particles in 
solution. The degree of this spectral shift increases with decreasing thickness 
of SAMs separating the metal cores of the nanoparticles (for example, 

JA" ~ 580 nm for Au/C.OH films but Hen = 770nm for Au/C3OH films). 
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Figure 2 | Photoconductance and inverse photoconductance of irradiated 
nanoparticle films. a, Change in current density on irradiation, Aj, for 
various types of nanoparticle films irradiated periodically with white LED 
light. Key specifies metal cores and coating thiols (Fig. 1a). Control 
experiments with thin films of thermally evaporated Au or Ag show no 
conductivity changes on irradiation. b, Dependence of Aj on time t 
(immediately after the irradiation commences) for a series of uncharged 
thiols differing only in chain length. Inset, linear regions in the semi-log plot 
correspond to an exponential Aj(t) dependence; slopes equal the inverse of 
T,. €, T, decreases approximately linearly with decreasing thickness, d, of 
SAMs separating nanoparticle metal cores; d was estimated from molecular 
modelling and, independently, from TEM images of the films (methods 
agreed to within 15%). Errors in the times are of the order of a few per cent 


changes depend on the chemical nature of the thiols coating the 
nanoparticles but not on the nature of the particles’ metal cores. 
Figure 2a shows typical variations in the relative current density, 
Aj = (irr — jo)/Jo Yo and jir, indicate current density in the respective 
absence and presence of irradiation), recorded for different types of 
nanoparticle films (all at E= 20kV m_' and for nanoparticles coated 
with monocomponent SAMs) periodically exposed to white LED 
light of intensity [ynite = 60 W.W cm” *. When the thiols are electric- 
ally neutral, current density increases on irradiation, Aj > 0. When, 
however, the thiols contain electrically charged groups, Aj<0. 
Decreasing current densities are observed in films composed of either 
positively charged nanoparticles (coating thiol C;, NMe;*) or nega- 
tively charged nanoparticles (deprotonated thiol C,)COO ). 
Interestingly, films comprising nanoparticles stabilized with proto- 
nated thiols C}yCOOH exhibit positive Aj changes (compare curves 3 
and 5 in Fig. 2a). 

When light is switched on or off, Aj(t) depends on time t approxi- 
mately exponentially, Aj(t) « (—¢/t,), where t, denotes a character- 
istic ‘switching’ time, which is the same for ‘on’ and ‘off exponents, 
does not depend on the intensity of light, and is of the order of 
hundreds of milliseconds. For a given terminal functionality of the 
thiol molecules comprising the SAMs, the switching times increase 
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(~7%) based on measurements of five cycles of irradiation for at least three 
independent samples. d, Irrespective of E, and for a given value of I (here, 

60 uW cm 7), the changes in photocurrent |Aj(/)|,; are maximal when / is 
close to the SPR maximum of the nanoparticles in the film. Violet symbols, 
Au/C.OH films (maximal | Aj(/)|; at around 7° = 580 nm); red symbols, 


Ag/C,,NMe;* films (maximal | Aj(A) |r at JA8 "470 nm); triangles, 


“max 
370-440 nm; circles, 410-500 nm; squares, 490-590 nm; inverted triangles, 
640-780 nm. Inset, absolute values of current density illustrating linear 
dependence of j;,. — jo on E. e, For constant E (here, 20 kV m_!) and for 
different wavelengths, 4, relative current densities scale linearly with the 
intensity of irradiating light, Aj(I) « I. Symbols and wavelength ranges as in 
d. For different material samples (>5 for each condition/material tested), 
the values of t,. and Aj in c—e agree to within less than 6%. 


approximately linearly with increasing chain length (Fig. 2b and c). 
For identical SAMs, the switching times increase with decreasing 
nanoparticle size (for example, t, ~ 0.29 s for Au/C,OH with metal 
core diameter 5.6 nm versus tT, ~ 0.18 s for 10.0-nm Au/C,OH part- 
icles). We note that in all cases the values of t, are not related to the 
intrinsic current ‘response’ times of the measuring device (<1 ms) 
accompanying the step-wise change in the applied field. 

A second set of observations relates the magnitudes of the relative 
current changes, |Aj], to the field strength, E, the wavelength, 1, and 
the intensity, J, of the irradiating light. For all films studied and all 
values of 2, we find that both the “dark current and the light-induced 
change are linear in E, such that the relative light-induced change 
|Aj(E)|,, is independent of E (Fig. 2d). Furthermore, at constant 4, 
|Aj(1)|, is linear in the light intensity I (up to about tens of tW cm’; 
for higher intensities, photocurrent effects are masked by increasing 
temperature of the film). Figure 2e illustrates this effect for Au/C;0H 
and Ag/C,,NMe;~ films irradiated with light of intensity up to 
I=8Wcm ”. The corresponding slope depends on the nature of 
the nanoparticles/SAM and on the wavelength /. Finally, for all values 
of Eand I, |Aj(/)|,.; is maximal when / coincides with the SPR of the 
aggregated nanoparticles’ metal cores. Thus, for Au- and Ag-based 
films, |Ajj is maximal when the wavelength is close to 24°. = 580 nm 
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and /8 = 470 nm, respectively (Fig. 2d and e). For films comprising 
both Au and Ag particles, the superposition of the two SPR peaks 
results in maximal current changes over a wider range of wave- 
lengths, 2 ~ 400-650 nm. In all of the above experiments, the largest 
current density change for uncharged thiols is ~6% for Au/CsO0H 
films. For charged thiols, changes as large as —27% are observed for 
films composed of Ag/C,,;NMe3~ nanoparticles (compare Fig. 2a). 
Both positive and inverse photoconductance can be explained 
qualitatively by a model (Supplementary Information sections 2, 3) 
in which the nanoparticle array is characterized by a density N, of 
discrete ‘trap sites’ of which n, are filled at thermal equilibrium. 
Physically, we suggest that these traps are of two types: (1) sites on 
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Figure 3 | Energy diagrams and temperature dependence of responses. 

a, Left panel: qualitative electronic structure diagram for nanoparticles 
capped with uncharged ligands. Yellow bars, nanoparticles’ metal cores 
characterized by Fermi energies, Ep, which in the absence of irradiation 
define an approximate border between occupied and unoccupied states. On 
irradiation, electrons are promoted into excited metal states, ~2 eV 
above”!°! Ey. Molecular levels of the non-polar alkane thiols lie ~3—5 eV 
above’*”° Ey. Right panel: current though a film of Au/C,,OH nanoparticles 
as a function of inverse temperature (differences in slopes recorded for 
different material samples are within 4% of the average). Because 

In(jo) ~ —E,/kgT + constant (ref. 26), the slope (E,) corresponds to the 
amount of energy that the environment contributes to the charge transport 
process (probably a mixture of thermally activated hopping and 
temperature-independent tunnelling). b, For nanoparticles capped with 
charged ligands, the scenario presented in a is augmented by the presence of 
polaron-like states within the organic layer separating the nanoparticles. The 
existence of these states is supported by temperature dependent current 
measurements (right; Au/C,;NMe;_, error in the slope <4%) showing that 
charge transport through arrays of charged nanoparticles is negatively 
activated (increasing the temperature decreases the current for a given 
applied voltage). Negative activation is common in charge transport 
systems” where an equilibrium exists between two conformational states of 
a donor-bridge-acceptor complex. Here, the molecular orbitals comprising 
the charged moieties (that is, the bridge) are characterized by two 
conformations (left), one of which (B;) is more favourable for electron 
transfer than the other (Bz). Negative activation occurs when raising the 
temperature pushes the conformational equilibrium towards the 
unfavourable geometry (B,) (Supplementary Information section 2). The 
observed activation barrier, Exbs =E,+AG = —0.9 eV, can be used to 
estimate the difference in free energy, AG, between the two bridge 
conformations—here, AG ~ —1 eV. Also, the energy of bridge states 
corresponding to the polar groups before polaronic reorganization is ~2 eV 
below those of the alkyl chains (refs 27-29; Supplementary Information 
section 2). Thus, in the presence of light irradiation, the lower energy bridge 
state (Bz) can act as a trap for charge carriers, resulting in a negative 
photocurrent (main text). 


LETTERS 


the nanoparticle ‘cores’, and (2) sites on the organic ligands. The 
former arise because the Fermi energy of the nanoparticles is several 
eV below’? that of the alkyl chains separating the particles. Therefore, 
each nanoparticle can accommodate a certain number of electrons 
depending on its charging energy, which in turn depends on the 
nanoparticle radius and capacitance’®'’. In reality, charge carriers 
‘trapped’ on the cores are delocalized over the particles’ surface; 
however, for simplicity we treat each core as a collection of discrete 
trap sites, each of which can accommodate a single charge carrier. 
Additionally, the organic ligands separating the nanoparticle cores 
may also act as traps if (as in the case of charged thiols discussed 
below) their conduction orbitals are of lower energy than the plas- 
mon state of the nanoparticles. Within the framework of this simple 
but widely used trapping model'*!’, the effective equilibrium 
between free and trapped carriers (1 and n,, respectively) is given by: 


n[N, — m1] 


nN 


=K (1) 


where K is an effective equilibrium constant characterizing the den- 
sity of free and trapped carriers at the steady state (Supplementary 
Information section 3.4). In this way, the density of traps determines 
the density of free charge carriers, n, and thereby the conduction 
current, j = —enE, where e is the fundamental charge, and 1 is the 
mobility of the charge carrier in the nanoparticle medium. 

For arrays of nanoparticles capped with uncharged ligands, the 
only trap sites are those on the nanoparticle cores (see Fig. 3a), and 
electrons tunnel from nanoparticle to nanoparticle through a large 
barrier imposed by the insulating organic SAM. This barrier is 3-5 eV 
(refs 15, 20) relative to the Fermi energy of gold (—5.1 eV; ref. 15) or 
silver (—4.3 eV; ref. 15). Starting from the steady-state current in the 
dark, photoexcitation at or near the plasmon resonance of the nano- 
particles (Fig. 2d, e) promotes electrons from the valence band to the 
conduction band of the nanoparticles. Subsequently, the photoex- 
cited electrons ‘fill available trap sites present on the nanoparticles, 
thereby reducing the effective density of these sites by 
AN, = Nit —N,<0. Through the effective equilibrium relation (1), 
the decrease in N, acts to increase the density of free carriers, n, and 
produces a positive Aj in the uncharged-SAM systems. 

For arrays of nanoparticles capped with charged SAMs, the traps 
on the nanoparticle cores are supplemented by trap sites on the 
organic ligands (see Fig. 3b). These additional traps probably arise 
due to the formation of polaron-like states via the so-called bootstrap 
mechanism, whereby electron localization occurs through the reor- 
ganization of the nuclear environment of the ionic moieties in the 
presence of an electron”’®*'. The degree of energetic stabilization of 
these polaronic traps via nuclear reorganization was estimated experi- 
mentally at ~1.0eV from the temperature dependence of the con- 
duction current (Fig. 3b right, and Supplementary Information 
section 2). In the dark, the energy of these states is expected to lie 
above the Fermi energy of the nanoparticles, and the organic ligands 
do not act as trap sites but rather as part of the tunnelling barrier 
(Fig. 3b left). On irradiation, however, excitation at the plasmon 
resonance of the nanoparticles promotes charge carriers to an injec- 
tion energy that is ~2 eV above their injection energy in the dark?" 
Relative to this excited state, the charged ligands become effective trap 
sites on irradiation, thereby increasing the density of traps AN, > 0 
(see Supplementary Information section 3.4 for an alternative but 
ultimately equivalent kinetic interpretation). Consequently, the den- 
sity of free carriers, n, decreases in accordance with equation (1), and 
the current decreases to give a negative Aj in the charged systems. 

We note that the formation of polaron-like traps cannot be affec- 
ted by the electric field produced by the oscillating electron density of 
the plasmon on the nanoparticles (Supplementary Information sec- 
tion 4). Conduction electrons on the nanoparticles excited at or near 
the plasmon resonance oscillate at optical frequencies (~10'* Hz) 
and cannot couple to molecular motions/structural rearrangement 
within the organic SAMs. Furthermore, the d.c. fields produced by 
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these oscillations are, for the particle sizes used here, vanishingly 
small, and so cannot align dipoles within charged SAMs””. 

An interesting manifestation of the two trapping scenarios is seen 
when the films contain both uncharged and charged ligands. 
Figure 4a plots the results of a series of experiments with films com- 
prising (1) binary mixtures of charged and uncharged nanoparticles 
(here, Au/C,,NMe3* and Au/C,,OH; blue squares) and (2) nano- 
particles coated with mixed SAMs” of charged and uncharged thiols 
(C,;NMe;* and C;,OH; yellow circles). For both types of system, 
the overall relative current changes can be expressed as a linear 
combination of the current changes of monocomponent films, 
Aj= XxouAjou + %nme3+ Ajnmes+> Where Zou and %nmMe3+ = 
1 — You are the fractions of either nanoparticles of each type in the 
film or of different thiols on the nanoparticle surfaces. These linear 
dependences imply that the effects of polaronic-like centres on car- 
rier trapping are additive, such that the number of trapping sites 
introduced on irradiation increases linearly with the number of 
charged moieties in the material (Supplementary Information 
section 3.3.vi). Interestingly, at a particular ratio yoy ~ 0.6, the 
increase in the number of mobile carriers created by photoexcitation 
is exactly offset by the increase in the number of traps due to charged 
thiols, such that the effective number of trapping sites remains 
unchanged on irradiation and the photocurrent is identical to the 
dark current, Aj= 0. 
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Figure 4 | Conductance of mixed films and theoretical predictions. 

a, Typical linear dependence of Aj on the composition 7 of the nanoparticle 
films. Blue squares, films composed of Au/C,,OH and Au/C,;NMe3~ 
nanoparticles (NPs). Yellow circles, films comprising Au nanoparticles 
coated with mixed SAMs of C,,OH and C,,NMe;" thiols. Correlation 
coefficients: 7” = 0.982 for nanoparticle mixtures, r = 0.976 for mixed-SAM 
nanoparticles, r° = 0.962 for all data points; errors based on the analysis of at 
least seven samples per condition are less than ~10% of the averages shown. 
Similar linear trends are observed for other thiol pairs studied. Dashed line 
(fit) indicates that this linear trend is also predicted by the theoretical model 
(see main text). b, Positive and negative photocurrent response as calculated 
by charge transport simulations (Supplementary Information section 3). 
The current change on irradiation (or ceasing of irradiation) is well 
approximated by an exponential saturation to the new steady-state current 
with a switching time characteristic of the trapping kinetics (here, t, = 0.5 s). 
Inset, the magnitude of the photoresponse is linearly proportional to the 
light intensity. c, Experimentally recorded t, as a function of Aj for Au/ 
C,,;NMe3~ and Au/C,OH nanoparticle arrays using light irradiation of 
different intensities. Solid lines, theoretical fits from the charge transport 
model, illustrating that t, is largely independent of the magnitude of Aj but 
depends on the nature of the ligands (here t, ~ 1.3 s for Au/C; iNMe;* and 
T, ~ 0.348 for Au/C,OH). d, Photocurrent response for Au/C,;OH 
nanoparticles for two different electrode separations, L = 50 1m and 

400 itm. The switching time does not depend on L (in agreement with 
theoretical prediction) but only on the nature of the ‘trap sites’ within the 
nanoparticles and, in the case of charge ligands, within the organic layer. 
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To gain further insight into the response of the material to light 
irradiation, it is necessary to account for the kinetic processes of trap- 
filling or trap-emptying. Starting from the charge transport equa- 
tions (Supplementary Information section 3), the positive and nega- 
tive changes in the current density on photoexcitation can be related 
to the changes in the total number of trap sites (from N, in the dark to 
N’, on irradiation) and the concomitant changes in the number of 
free charge carriers (from 1 to n’) subject to equation (1). It can be 
shown that in the absence of significant space charge injection’*"”, the 
transport equations can be simplified to give the following equation 
relating changes in n to the trapping kinetics: 


dn dn, 

dt = dt 
where k is a rate constant for trapping. In the limit of N,>n, this 
equation is linear and predicts the exponential saturation of n and n, 
to their new steady-state values: importantly, as j «x n, the model 
reproduces the exponential response of the system when light is 
switched on or off and also the linear dependence of j on the light 
intensity (Fig. 4b). Furthermore, 1,~1/kN, (Supplementary 
Information section 3), so that the switching timescale observed in 
experiments (0.1-1s) is equivalent to that associated with charge 
trapping. This timescale is commensurate with but slightly faster 
than the 10-100 s trapping times previously found for charge reten- 
tion within gold nanoparticle transistors. The model also correctly 
predicts that for a given type of SAM, the switching time depends 
neither on Aj (Fig. 4c) nor on the width of the gap between the 
electrodes (Fig. 4d). Finally, the model reproduces the linear depend- 
ence of t, on the SAM thickness (compare Fig. 2c) and its decrease 


= k[—n(N',—m)+ Kn] (2) 
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Figure 5 | Modulation and ‘switching’ of photoconductance. a, Exposure of 
a film of Au nanoparticles coated with COOH thiols to the vapours of 
ammonium hydroxide causes partial deprotonation of carboxylic acid 
groups and decreases the photocurrent; exposure to vapours of hydrochloric 
acid re-protonates these groups causing the photocurrent to increase. As the 
current does not switch to negative values observed for nanoparticles coated 
with fully deprotonated thiols (Fig. 2a), the deprotonation is probably only 
partial and/or limited by penetration of the film by the vapours’ molecules. 
Note that this experiment is performed in the presence of humidity, and the 
photoconductance effect is still observed. In fact, we verified that exposure of 
the film to neutral water vapour (relative humidity RH < 30%) does not 
alter photoconductance. b, Switching from positive to negative (inverse) 
photoconductance in a film of Au nanoparticles coated with ligands 
terminated in tetrathiafulvalene (TTF; structural formulae shown). The 
effect can be attributed to the TTF > TTF’* oxidation (blue, neutral TTF; 
red, TTF dication) previously reported to occur at ~2 V (ref. 30). 
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with increasing nanoparticle size (because as the volume fraction of 
gold within the film decreases, the density of traps within the array 
decreases). 

We have described a class of nanostructured materials that exhibit 
photoconductance as well as previously undescribed inverse photo- 
conductance. From a fundamental perspective, the present work 
provides new insights into electron transport in heterogeneous 
nanostructured media in which photoexcitation and trapping of 
charge carriers can be spatially separated between metallic nanopar- 
ticle and insulating SAM domains. Incorporation of molecules that 
can be interconverted between charged and uncharged forms could 
enable external control of the degree of photoconductance (for 
example, by chemical means, Fig. 5a) or even switching between 
normal and inverse photoconductance modes (for example, by the 
applied bias, Fig. 5b). 
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Evidence for middle Eocene Arctic sea ice from 
diatoms and ice-rafted debris 


Catherine E. Stickley’*, Kristen St John’, Nalan Koc’”, Richard W. Jordan‘, Sandra Passchier’, Richard B. Pearce® 


& Lance E. Kearns” 


Oceanic sediments from long cores drilled on the Lomonosov ridge, 
in the central Arctic’, contain ice-rafted debris (IRD) back to the 
middle Eocene epoch, prompting recent suggestions that ice 
appeared in the Arctic about 46 million years (Myr) ago”. 
However, because IRD can be transported by icebergs (derived 
from land-based ice) and also by sea ice*, IRD records*’ are 
restricted to providing a history of general ice-rafting only. It is 
critical to differentiate sea ice from glacial (land-based) ice as 
climate feedback mechanisms vary and global impacts differ 
between these systems: sea ice directly affects ocean—atmosphere 
exchanges’, whereas land-based ice affects sea level and conse- 
quently ocean acidity®. An earlier report’ assumed that sea ice 
was prevalent in the middle Eocene Arctic on the basis of IRD, 
and although somewhat preliminary supportive evidence exists’, 
these data are neither comprehensive nor quantified. Here we show 
the presence of middle Eocene Arctic sea ice from an extraordinary 
abundance of a group of sea-ice-dependent fossil diatoms 
(Synedropsis spp.). Analysis of quartz grain textural characteristics 
further supports sea ice as the dominant transporter of IRD at this 
time. Together with new information on cosmopolitan diatoms 
and existing IRD records’, our data strongly suggest a two-phase 
establishment of sea ice: initial episodic formation in marginal 
shelf areas ~47.5 Myr ago, followed ~0.5 Myr later by the onset of 
seasonally paced sea-ice formation in offshore areas of the central 
Arctic. Our data establish a 2-Myr record of sea ice, documenting 
the transition from a warm, ice-free® environment to one domi- 
nated by winter sea ice at the start of the middle Eocene climatic 
cooling phase’. 

Current thinking suggests that the Cenozoic cryosphere evolved in 
phases, starting in the late—-middle Eocene with the presence of small, 
isolated ice caps in both hemispheres*””, and interrupted by at least 
one short warming event'’ before major ice-sheet expansion on 
Antarctica and concomitant deep-sea cooling in the late Eocene 
and early Oligocene’’. Much of the focus has been on land-based 
ice, but sea ice is an integral part of past and present cryospheric 
systems as it is involved in important climate feedbacks, such as 
changes in the global energy budget through the albedo effect>*. 
Yet relatively little is known of the temporal and geographic distri- 
bution of sea ice before the Quaternary, largely because its effects are 
expressed more subtly and, arguably, less widely in the geological 
record than those of land-based ice. Part of the problem is that 
transient or localized sea ice may leave no obvious sedimentary 
record, or that its effects are easily overlooked. Application of one 
of the conventional methods for tracing past sea ice, using sea-ice- 
associated diatoms in marine sediments’’, is less obvious for the 
pre-Quaternary and is certainly challenging for the current patchy 


Eocene diatom record, particularly in Antarctica (Supplementary 
Information). 

In the Arctic, pre-Quaternary sea-ice patterns are beginning to be 
understood through the Integrated Ocean Drilling Program 
Expedition 302 (‘ACEX’), which recovered the first long Cenozoic 
palaeoceanographic record from the Lomonosov ridge in the central 
Arctic’** (Fig. 1). It is now known, for example, that episodic 
perennial sea ice first appeared in the middle Miocene'*’*. Here we 
present evidence for middle Eocene seasonal sea ice from a ~2-Myr- 
long record of fossil sea-ice diatoms and sea-ice-dominated IRD; this 
record is contained within a ~36-m interval of finely laminated, 
organic-rich, biosiliceous sediments’'® that represent the start of 
Cenozoic cooling following the Early Eocene Climatic Optimum’ 
(Supplementary Fig. 1). These sediments contain abundant marine 
and freshwater siliceous and organic-walled microfossils’'*"*’, 
including diverse and largely endemic diatom assemblages character- 
ized by heavily silicified marine Palaeogene taxa such as Hemiaulus 
spp.’"° and Anaulus arcticus"*. 


Pacific Ocean 


ACEX@arill sites 


proto-Atlantic 
Ocean 


Tethyan Ocean 


Figure 1| Idealized palaeogeography of the Arctic region for the early 
middle Eocene during the phase of biosilica production and preservation at 
the Lomonosov ridge (~50-45 Myr ago). Redrawn from ref. 17 with the 
difference that the Turgay Strait (outlined) is essentially closed to any major 
exchange at this time, based on siliceous microfossil data (Supplementary 
Information). 
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Above 260.30m composite depth (m.c.d.), weakly silicified, 
araphid, needle-shaped pennate diatoms Synedropsis spp. (Figs 2 
and 3; Supplementary Figs 2—4) are preserved in abundances typically 
exceeding 25%, but reaching as high as ~61% of the diatom assem- 
blage. Previously overlooked in initial investigations’, these are some 
of the most delicately silicified diatoms discovered in the fossil 
record, with preservation so remarkable that micro-aggregate and 
colony integrity is commonly still intact (Supplementary Infor- 
mation). Sediment trap studies show that most diatom valves do 
not reach the sea floor because normal sea water is under-saturated 
in silica, making opaline (amorphous hydrous) silica—that impreg- 
nates the diatom cell-wall—highly susceptible to dissolution and 
biasing diatom records towards relatively large or heavily silicified 
types’. Weakly silicified diatoms are preserved only under excep- 
tional circumstances, and are particularly unusual for Palaeogene 
sediments. In this respect, the ACEX diatom record is unprecedented 
and reflects a most unusual environmental setting. We attribute the 
preservation of Synedropsis spp. in these sediments to the presence of 
sea ice and silica-enriched waters. 

Today, colonies of extant Synedropsis spp. are peculiar to polar sea 
ice*® (Supplementary Information). The ACEX fossil Synedropsis spp. 
are uniquely associated with the middle Eocene IRD record from the 
same cores (Fig. 3), and here we demonstrate that this IRD record is 
predominantly derived from sea ice. On the basis of this association 
and by analogy with living Synedropsis spp., including the manner in 
which they are preserved (Supplementary Information), we propose 
that the ACEX Synedropsis spp. were, like their modern counterparts, 
sea-ice-dependent and, as such, highly specialized extremophiles, 
uniquely adapted for surviving the lengthy polar darkness and freez- 
ing temperatures. These diatoms provide the most compelling evi- 
dence for ancient sea ice, as they rely on this medium for their 
survival. Furthermore, the ACEX Synedropsis spp. represent the earli- 
est known fossil record of sea-ice diatoms (Supplementary 
Information). Also we found no reports in the literature of 
Synedropsis spp. (or of any synonyms” or misidentifications) in 
pre-Holocene sediments from elsewhere in the Arctic and subarctic 
regions, suggesting their geographic and stratigraphic limitation. 

The presence of Synedropsis spp. in the ACEX sediments shows that 
non-permanent (for example, seasonal) sea ice existed over the 


Figure 2 | SEM image of part of an aggregate of near-whole needle-shaped 
Synedropsis sp. valves. Imaged along a lamina-parallel fracture 
(topographic) surface of unprocessed core material at 240.90 m.c.d. (302- 
2A-55X5, 19.5 cm; 46.16 Myr ago). The main part of the image shows most 
valves are missing just the apices. Valves that have been fractured in several 
places are visible in the lower left and upper right corners of this image. Scale 
bar, 3 pum. Images of colonies, aggregates, and details of the apices are shown 
in Supplementary Figs 3 and 4. 
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Figure 3 | IRD and sea-ice diatom abundance in the ACEX cores. Shown are 
data for the >250 um (red) and 150-250 um (green) quartz fractions, and 
the relative abundance (percentage of total diatoms) of needle-shaped 
Synedropsis spp. (blue). Over the age range shown, shallow exchange 
between the Lomonosov ridge and the proto-Norwegian-Greenland sea 
remains possible year-round until 46.97 Myr ago, but becomes seasonally 
restricted at younger ages. FO, first occurrence; FAO, first abundant 
occurrence; LO, last occurrence; LR, Lomonosov ridge. IRD data are from 
ref. 2. Age model is from ref. 13. 


Lomonosov ridge, and that regular (for example, annual) melting 
episodes, into silica-enriched waters, permitted their preservation. It 
is likely that sea ice formed in autumn and winter and melted in spring 
and summer, as seasonal sea ice does today. We suggest that 
Synedropsis spp. over-wintered within the sea ice and later bloomed 
there after irradiance increased in the following spring. Following sea- 
ice melt, they were released into stratified surface waters where 
aggregation was undoubtedly intensified by virtue of their stickiness”! 
and needle shape. As aggregation enhances sinking rates, these were 
unlikely candidates for seeding the melt zone’’. We therefore suggest 
they did not constitute a significant part of the summer growth along- 
side other diatom taxa. Instead, they underwent rapid flux to the sea 
floor where they remained largely undisturbed under an anoxic 
benthic environment'®”. Such sea-ice-driven seasonality is evident 
from our preliminary investigations of the laminations that indicate 
Synedropsis spp. form regularly occurring, near-monogeneric lamina- 
tions, commonly associated with IRD. This indicates that production 
and flux of sea-ice diatoms occurred at a different stage in the annual 
cycle to that of the other ACEX diatoms, and that Synedropsis spp. 
laminations probably represent episodes of spring or summer sea-ice 
melt and flux, a hypothesis necessitating examination of the lamina- 
tion sequences through the core. 

The current estimate for the initiation of ice in the Arctic is 
~46.25 Myr ago’ (Fig. 3), based on IRD evidence only, in particular 
on the doubling of sand-sized IRD flux to 0.02 g¢cm™*kyr' and 
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establishment of a quasi-cyclicity*. However, IRD exists in the ACEX 
cores as far back as ~47.5 Myr ago? (Fig. 3), albeit in low abundance. 
Synedropsis spp. appear in the record 46.97 Myr ago, thereafter co- 
varying with IRD throughout the biosilica-rich core sections 
(Supplementary Fig. 2). The appearance of Synedropsis spp. in the 
ACEX cores coincides with the disappearance of a cosmopolitan 
(warmer water) diatom component characterized by Porotheca 
danica and Pterotheca aculeifera (Fig. 3), whereas other endemic taxa 
persist throughout the cores. Considering that the IRD record is 
largely derived from sea-ice transportation (as we show here), the 
timing of this floral turnover strongly suggests a two-phase establish- 
ment of middle Eocene Arctic sea ice (Fig. 3). 

We propose that the onset of IRD in the ACEX cores ~47.5 Myr 
ago marks the onset of shore-fast ice and/or frazil ice, forming up- 
current in one or more shallow areas of the broad marginal Arctic 
shelf, which entrained and (via wind-driven surface currents) trans- 
ported sand-sized terrigenous material to the central Arctic. Marginal 
sea-ice formation was likely to have been episodic, based on the low 
abundance of IRD (Fig. 3) and the absence of sea-ice diatoms in this 
phase (Supplementary Information). We further propose that the 
appearance of well-preserved, lamina-forming sea-ice diatoms and 
the concurrent disappearance of cosmopolitan diatoms ~47 Myr ago 
marks the onset of seasonally paced offshore sea-ice formation—for 
example, pack ice. It is conceivable that—under generally declining 
middle Eocene temperatures and atmospheric carbon dioxide (CO,) 
levels (refs 23, 24)—-sea ice would have formed initially at the mar- 
gins, later advancing into offshore areas of the Arctic Ocean, to 
include at least the surface-waters above, or close to, the palaeo- 
location of the ACEX drillsite (Fig. 1). A parallel increase in IRD 
(Fig. 3) indicates that ice-rafting of shallow shelf sediments intensi- 
fied at the same time. 

Before ~47 Myr ago, year-round shallow oceanic exchange 
between the Lomonosov ridge and the proto-Norwegian- 
Greenland Sea was possible, indicated by the presence of P. danica 
and Pt. aculeifera in coeval sediments from this area. Once an off- 
shore seasonal sea-ice regime was established, the northern limit of 
ice-intolerant cosmopolitan diatoms was driven southwards, and the 
potential for shallow oceanic exchange became restricted to the 
summer melt season. 

Our data provide compelling evidence for the establishment of an 
offshore winter sea-ice regime ~47 Myr ago that was stable enough to 
support an ecology and persistent enough to cause the southward 
migration of cosmopolitan taxa. Sea-ice-driven positive climate feed- 
backs, such as reduced oceanic heat loss to the atmosphere and 
enhanced albedo—by sea ice? (in spring) and cloud formation” (in 
summer)—would therefore have become particularly important at 
this time. A stable sea-ice regime also suggests the possibility of 
concomitant glacial ice. If glaciers were also present, then an under- 
standing of the relative contribution of both sea ice-IRD and iceberg- 
IRD is essential. 

A possible discriminator in the Arctic is quantitative analysis of the 
surface textures of ice-rafted quartz grains*®, using a statistical 
approach”. IRD grains exhibiting both sea-ice and iceberg textures 
are present in a short study interval (~236-242 m.c.d.; Fig. 4). 
Discriminant scores are significantly different for the two groups 
with 93% correctly classified, although some overlap exists between 
the populations (Supplementary Fig. 5). Our data indicate that sea ice 
was the dominant mode of ice transport between ~46.2 Myr and 
~46.0 Myr ago. Before ~46.15 Myr ago (240.53 m.c.d.), 80-100% 
of the grains exhibit sea-ice textures (0-20% iceberg-IRD textures). 
The highest input of iceberg-IRD occurs ~46.15 Myr ago, after which 
peaks in iceberg-IRD broadly correspond to peaks in total IRD 
abundance (Fig. 4). This suggests the presence of small isolated 
glaciers that pre-date estimates for those on Greenland (~38 Myr 
or ~44 Myr ago, refs 10 and 28 respectively), in keeping with results 
of a global climate/ice-sheet model’; this model shows the potential 
for small isolated ice caps on higher Arctic elevations in the Eocene 
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Figure 4 | Results of IRD analysis in a short core interval. a, SEM images of 
IRD with textural characteristics of sea-ice transport (upper row) and 
iceberg transport (lower row). b, Percentage of iceberg-rafted versus sea-ice- 
rafted quartz sand (>250 tum) and correlation with IRD abundance. Details 
are provided in Supplementary Information. 


and Oligocene, and no major glaciation until ~23 Myr ago or later’. 
Nevertheless, even where total IRD input is high, 60-80% of grains 
are classified as sea ice-IRD, confirming the importance of sea ice in 
the middle Eocene Arctic. 

The ACEX middle Eocene sea-ice record pre-dates known major 
Cenozoic glacial and sea-ice records, which has two intriguing con- 
sequences: (1) the current ‘icehouse’ cryosphere started to evolve 
earlier than previously thought; and (2) significant sea ice formed 
in the central Arctic some ~24 Myr before major ice-sheet expansion 
in the region’. Therefore, sea ice—ice sheet linkages, such as those 
surmised for Cenozoic Antarctica’, could not have existed in the 
middle Eocene Arctic. Furthermore, together with results of a 
numerical model for Cenozoic Antarctic climate evolution’, and 
known Cenozoic Antarctic sea-ice records (Supplementary 
Information), our data indicate that sea ice formed in the Arctic 
before it did in Antarctica. Under generally declining middle 
Eocene atmospheric CO (refs 23, 24), this implies that the threshold 
for sea-ice formation was crossed in the Arctic first, a hypothesis 
opposite to that modelled for glacial ice, whereby Antarctica is shown 
to glaciate much earlier (that is, at higher levels of CO) than circum- 
Arctic continents’. 

Arctic palaeogeography and palaeoenvironments may ultimately 
account for these consequences. Strong salinity stratification’®” pre- 
vailed within a basin essentially isolated from the world’s oceans 
(Fig. 1), where low surface salinities*”*° appreciably facilitated annual 
freezing. A notable influx of fresh water’? and subsequent fall in 
salinity starting ~47.6 Myr (refs 29, 30) or ~47.2 Myr (ref. 16) ago 
strongly suggests salinity-controlled preconditioning just before and 
during sea-ice initiation. 


METHODS SUMMARY 
Quantitative diatom analysis. Dried sediment samples from the interval 
~201-302 m.c.d. (cores 302 2A 47X to 4A 11X) were weighed to a value between 
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~0.1-0.2 g, processed for biogenic silica and analysed using the quantitative and 
light microscopy methodologies outlined elsewhere'®. Sampling resolution was 
at an average of ~20 cm over critical intervals (otherwise at every ~30-100 cm 
excluding coring gaps). Light microscope images were taken using a Panasonic 
DMC-FS5 camera at ISO 1600 and at <1,000 magnification. Further aliquots 
were dried onto aluminium scanning electron microscope (SEM) stubs, gold 
sputter coated and imaged using an LEO 1450VP SEM at 15-20 kV. Lamina- 
parallel fracture (topographic) surfaces of unprocessed core material were also 
imaged in the SEM. 

Quantitative IRD abundance. We followed previously outlined methods’. 
Quantitative IRD surface texture analysis. We analysed 160 grains; 10 quartz 
grains from the >250 jm size range were randomly picked for SEM imaging 
from each of 16 samples between ~236 and 242 m.c.d. (cores 302 2A 55X-56X). 
This interval was selected for its good core recovery’, good age control"*, robust 
IRD record’, and for encompassing the oldest recorded dropstone'® (‘isolated 
pebbles’ on Figs 3 and 4 and Supplementary Fig. 2). Samples were analysed using 
an LEO 1430VP SEM. Elemental (EDS) analysis verified that each grain was 
quartz. Statistical analysis included principal component analysis (PCA) and 
discriminant analysis”. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 

Quantitative IRD abundance. Bulk sediment samples were freeze-dried and 
weighed. Samples were wet-sieved and dried at 60°C in an oven after which the 
sediment fractions were weighed. Visual estimates of the vol.% of the terrigenous 
abundance in the 150-250 and >250 im fractions were made using a binocular 
microscope. The terrigenous grains in these sand-size fractions are considered to be 
IRD. The wt% IRD is calculated by multiplying the wt% abundance of the sand 
fraction (150-250 tm, or >250 pm) by the visual estimate of the abundance of 
terrigenous grains in the sand fraction. 

Quantitative IRD surface texture analysis. A checklist approach”! was used, 
involving photographing and recording the presence/absence ofa series of features 


nature 


onevery grain. Checklist categories are a hybrid from refs 26 and 27. The number of 
variables (textures) was reduced to result in a data set with as many sea-ice textures 
as iceberg textures and the values for all variables were normalized to between 0 and 1. 
Based on three principal components, accounting for ~50% of the variance in the 
data set, individual grains were classified as either sea-ice or iceberg sourced, based on 
the classification of ref. 26. For the discriminant analysis, grains were assigned to 
groups according to the PCA results. 


31. Helland, P. E. & Diffendal, R. F. Jr. Probable glacial climatic conditions in source 
areas during deposition of parts of the Ash Hollow Formation, Ogallala Group 
(Late Tertiary), of Western Nebraska. Am. J. Sci. 293, 744-757 (1993). 
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Migration of the subtropical front as a modulator of 


glacial climate 


Edouard Bard’ & Rosalind E. M. Rickaby” 


Ice cores extracted from the Antarctic ice sheet suggest that glacial 
conditions, and the relationship between isotopically derived 
temperatures and atmospheric pco, have been constant over the 
last 800,000 years of the Late Pleistocene epoch’. But independent 
lines of evidence, such as the extent of Northern Hemisphere ice 
sheets’, sea level? and other temperature records‘, point towards a 
fluctuating severity of glacial periods, particularly during the more 
extreme glacial stadials centred around 340,000 and 420,000 years 
ago (marine isotope stages 10 and 12). Previously unidentified 
mechanisms therefore appear to have mediated the relationship 
between insolation, CO, and climate. Here we test whether north- 
ward migration of the subtropical front (STF) off the southeastern 
coast of South Africa acts as a gatekeeper for the Agulhas current>®, 
which controls the transport of heat and salt from the Indo-Pacific 
Ocean to the Atlantic Ocean. Using a new 800,000-year record of 
sea surface temperature and ocean productivity from ocean sedi- 
ment core MD962077, we demonstrate that during cold stadials 
(particularly marine isotope stages 10 and 12), productivity 
peaked and sea surface temperature was up to 6 °C cooler than 
modern temperatures. This suggests that during these cooler 
stadials, the STF moved northward by up to 7° latitude, nearly 
shutting off the Agulhas current. Our results, combined with 
faunal assemblages from the south Atlantic’* show that variable 
northwards migration of the Southern Hemisphere STF can 
modulate the severity of each glacial period by altering the 
strength of the Agulhas current carrying heat and salt to the 
Atlantic meridional overturning circulation. We show hence that 
the degree of northwards migration of the STF can partially 
decouple global climate from atmospheric partial pressure of 
carbon dioxide, pco,, and help to resolve the long-standing puzzle 
of differing glacial amplitudes within a consistent range of atmo- 
spheric pco,- 

The fluctuating severity of glacial periods (see Supplementary 
Information) challenges our understanding of the sensitivity of global 
climate to forcing by astronomical parameters, and pco,. We inves- 
tigate the migration of the STF in the southern Indian Ocean over 
multiple glacial cycles to explore whether the latitude of the STF acts as 
an insolation-sensitive feedback in the climate system, and amplifies 
low-latitude temperature anomalies to global climate phenomena. 
The latitude of the STF is determined by the strength and location 
of atmospheric winds. The subtropical convergence, characterized by 
negative wind stress curl in the atmosphere and associated Ekman 
transport convergence in the ocean, denotes the region between the 
core of the trade winds and the maximum southwesterly winds. 
Embedded in the subtropical convergence and towards its poleward 
side is the STF, a zonal band of enhanced productivity and weak (2 °C) 
meridional sea surface temperature (SST) gradients. The STF defines 
the southern limit of the subtropical gyres and separates them from 


the broad eastward flow of the Antarctic circumpolar current (ACC) 
further south. 

We reconstruct the latitude of the STF from hydrographic changes at 
core MD962077, currently situated under the Agulhas current of the 
subtropical gyre of the Indian Ocean over the last 800 thousand years 
(kyr) (see online-only Methods). We test whether northward migration 
of the STF relative to South Africa acts as a gatekeeper for the leakage of 
heat and salt between the Atlantic and Indo-Pacific oceans. The Agulhas 
valve comprises surface and intermediate water transport from the 
Indian Ocean into the Atlantic. This takes place through the detachment 
of rings, eddies and filaments from the Agulhas retroflection, consisting 
of warm and salty Indian Ocean waters and providing a key return route 
for the supply of waters to the Atlantic meridional overturning circula- 
tion (AMOC)*®. Core MD962077 is situated ~7° latitude equatorwards 
of the modern STF (33.17 °S, 31.25 °E, 3,781 m, Fig. 1), and provides a 
sensitive monitor of the meridional movements of the STF, which 
marks the southern boundary of the Agulhas return current. 

Our 800-kyr records of SST (U*37', and 8'%0 of Globigerina inflata 
and coccolithophore fraction; Fig. 2a) and biological productivity 
(abundance of authigenic uranium Uautn, total organic carbon 
TOC and total C37 alkenones C37;,.;3 Fig. 2b, data available in 
Supplementary Information) indicate that each 100-kyr glacial cycle 
is marked by cooler temperatures and higher productivity, indicating 
a northerly migration of the STF more proximal to core MD962077, 
and displaying the same distinctive pattern of a secular trend in the 
glacial minima. The glacial northward shift of the STF agrees with 
records from foraminiferal and coccolithophore assemblages in this 
region’ "', which highlight that the reappearance and abundance, in 
the south Atlantic, of the ‘Agulhas leakage fauna’ is due to a deglacial 
strengthening of the Agulhas connection. 

The marine isotope stages MIS-12 and MIS-10 stand out as excep- 
tionally cool and productive glacial periods. Alkenone-derived SSTs 
record a subtropical Indian Ocean 4°C cooler than the Holocene 
during MIS-2, but MIS-12 and MIS-10 were an additional 2 °C colder. 
This secular trend in SST is matched by the G. inflata 8'%O value (the 
MIS-12 5'%O is heavier than that of MIS-2 by 0.5%, ~2°C) and 
coccolithophore fraction 5'*O (MIS-12 is heavier than MIS-2 by 
1.5 %o but natural effects amplify the coccolithophore fraction 5'°O; 
see online-only Methods). All three productivity proxies imply 
enhanced productivity during MIS-12 and 10 compared to other gla- 
cials. Individually, these proxies cannot be scaled linearly to produc- 
tivity but are highly sensitive to productivity changes (see online-only 
Methods). However, the coherence in signal between three SST and 
productivity proxies, recorded in different phytoplankton and different 
phases of the sediment susceptible to a range of biases, lends confidence 
that our secular trend in temperature and productivity is robust. 

A picture emerges that the STF migrated northwards towards core 
MD962077 during each glacial period. During MIS-12 and MIS-10, 
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Figure 1| SeaWiFS (Sea-viewing Wide Field-of-view Sensor) image of 

ocean colour during austral summer with the highly productive STF to the 
south of core MD962077 (red star). The core is at present in the path of the 
Agulhas current, and is influenced by the southward flow of warm and saline 


the STF advanced by at least 7° latitude to reach the lowest latitude 
(33 °S) of the past 800 kyr, as marked by the additional 2 °C cooling 
that is characteristic of the SST gradient of the modern STF. The STF 
impinged upon the southern African continent in the Indian Ocean 
during these extreme glacial periods and almost completely shut down 
the Agulhas leakage*’°"', with a tiny residual, perhaps intermittent, 
flow to account for the rare occurrence of the Agulhas leakage fauna in 
cores from the Atlantic sector''’*. An associated migration north- 
wards of the southwesterly winds by 5-9° latitude during glacial 
periods is corroborated by a number of proxy records’’. This shift 
and change in intensity of the winds is difficult to capture consistently 
in intermediate-complexity models'* and coupled global climate 
models’>. 

We propose that a northerly migration of the STF relative to South 
Africa acts as a choke point in the climate system. The latitude of the 
STF has the potential to influence both surface buoyancy fluxes of heat 
and salt that set the strength and shape of the AMOC” and also mixing 
and wind-driven upwellings that input energy to sustain the AMOC. 

As a buoyancy driver of the AMOC, the transport flux of heat and 
salt in surface waters between the major ocean basins is crucial to 
return water, salt and heat to the northward flow of the AMOC’”"®. 
The exchange of surface water occurs along a ‘warm’ route and a ‘cold’ 
route; the balance between these routes may alter as oceanographic 
fronts migrate relative to the tips of continents in the Southern 
Ocean'’. Important topographic valves restrict the water flow in the 
Indonesian archipelago, Drake passage, Bering strait and the Agulhas 
valve of the southern tip of Africa’. These valves were drastically 
different during the 100-150 m sea-level drop of glaciations and could 
control ocean currents through threshold effects. Our inferred most 


waters that are ultimately transported from the Indian to the Atlantic oceans 
today. The core site is highly sensitive to meridional movements of the fronts 
in this critical region. LGM-WSI, Last Glacial Maximum winter sea ice 
extent. 


northerly migration of the STF to 33 °S during MIS-12 and MIS-10 
severely reduced the warm saline influence of Indian Ocean thermo- 
cline water and the Atlantic salinity budget owing to the Agulhas 
leakage*!*°°. This effect causes cooling and freshening of the 
Atlantic and implies maximal reduction in North Atlantic deep water 
formation during MIS-12 and MIS-10. 

Energy input to maintain the AMOC derives, in part, from the 
coupling between strong circumpolar southwesterly winds, which 
induce a vigorous ACC, upwelling of dense waters from the ocean 
interior, and northward Ekman transport of waters near the ocean 
surface” as well as southward eddies. Today, the frontal system and 
associated southwesterly winds cross the continent of South America, 
which diverts the ACC flow southwards through the Drake passage 
(56°S). At this point, the ACC is decoupled from the winds and 
frontal systems that transfer energy to the current*’. For the rest of 
the ACC pathway in the modern ocean, the westerly winds align with 
the ACC. We infer that either the core or meridional extent of the 
wind field of the southwesterly winds spread to their most northerly 
position during MIS-12 and MIS-10, and encountered two further 
continental barriers (Australia at 39 °S, and South Africa at 35 °S), 
decoupling the topographically constrained ACC to the south from a 
larger fraction of the frontal-associated winds and reducing the 
energy transfer to the ocean’. These additional circuit breaks in 
energy transfer from the southwesterly winds to the ACC would 
amplify a decrease in deep ventilation, on top of a severely reduced 
Agulhas leakage, during MIS-12 and MIS-10. 

To test whether northwards migration of the STF is communicated 
globally via an influence on AMOC, we consider our data in the 
context of a record of the 8'°C contrast between deep waters in the 
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Figure 2 | Long-term trends in SST and productivity-sensitive parameters 
from core MD962077 compared to the stable baseline glacial 
temperatures recorded in EPICA Dome C. a, Coccolith fraction 80 (CF, 
red dots, left axis), G. inflata sO (planktonic, green dots, left axis), and 
UX,,' (alkenone, blue dots, right axis) highlight the evolution of subtropical 
region glacial SSTs, compared to the stable baseline temperature estimates 
(black line, the relative abundance of deuterium, 5D, scaled according to 
(T + 34)/2 to fit the left axis) from glacial periods in Antarctica recorded in 
EPICA. The black arrow labelled ‘B/M’ shows the position of the Brunhes/ 
Matuyama in the deeper section of core MD962077. b, Long-term trends of 
productivity proxies—U,,,, in parts per million, p.p.m. (blue open circles), 
percentage TOC (black open circles) and C376, (red filled circles) —which 
show high productivity during each glacial period indicative of a northwards 
migration of the STF. This migration evolves, the STF being at its most 
northerly position during MIS-12 and MIS-10. (c.f.b., carbonate-free basis; 
see online-only Methods.) 


Atlantic and Pacific oceans, an indicator of the efficiency of deep- 
ocean ventilation (Fig. 3, Supplementary Information). Our secular 
trends in temperature and frontal location are accompanied by a 
glacial decreased contrast between the Pacific and Atlantic deep water 
masses (A3!°C), with near-zero Ad'°C during MIS-12 and MIS-10; a 
picture also obtained from the 8'°C vertical gradient in the Southern 
Ocean”. This lack of 5'°C gradient could indicate drastically reduced 
ventilation of the Atlantic, or enhanced ventilation in the Pacific. 
Both changes could have occurred with a similar ventilation contri- 
bution from the Southern Ocean. Recent numerical simulations of a 
latitudinal shift of southwesterly winds support a scenario with a 
decrease of Agulhas leakage by a factor of seven, and a substantial 
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Figure 3 | Mechanistic link between the migration of the STF, orbital 
eccentricity and ocean overturning. The position of the STF, denoted by our 
record of SST (black, open circles) co-evolved with orbital eccentricity (grey 
dashed line) and a measure of overturning strength (black line) based on the 
51°C gradient between the Atlantic (Ocean Drilling Program ODP site 607) 
and the Pacific (ODP site 846, A5'°C) oceans. This provides an independent 
measure of the ventilation strength*' of the AMOC over the last 800 kyr. We 
derive the AS'°C value from the difference between the 5'°C values of ODP 
Site 607 and ODP Site 847. 


decrease of North Atlantic deep water transport, with a concomitant 
increase and deepening of North Pacific convection”. These effects 
result from the wind shift and the enhanced build-up of Pacific 
surface salinity. 

The decrease in ventilation by the AMOC leads to reduced polar 
heat transport and amplifies the growth of northern ice sheets, and 
also leads to lowest sea levels during MIS-12 and MIS-10. These 
frontal choke points of the STF impinging on Australia and South 
Africa transfer the tropical cooling of MIS-12 and MIS-10 to a global 
climate signal. 

The position of the STF during MIS-12 and 10 could be a response 
to cooler global climates. Alternatively, a more regional forcing of 
anomalously cool equatorial temperatures could modulate the posi- 
tion of the STF. 

Global cooling drives an equatorward migration of the STF. A 
global cooling of 3 °C achieves a 7° equatorward migration of the 
fronts and the peak westerly wind stress position, according to an 
idealized model”. The residual between the EPICA (European 
Project for Ice-Coring in Antarctica) Dome C temperature record 
and the temperature component that is linearly linked with green- 
house gas radiative forcing shows a long-term trend of decreasing 
temperatures towards 400 kyr and increasing towards modern times, 
in parallel with the longer-term trend of the STF position. However, 
the minimum levels of pco, during MIS-12 and MIS-10 are not 
exceptionally low compared to other glacial periods'. Although 
increased global coccolithophore production” implies a quasi-400- 
kyr change in the global carbon cycle, Pleistocene trends in pco, are 
compensated biogeochemically” or masked by higher-frequency 
cycles in the record. Exceptional glacial conditions during MIS-12 
and MIS-10 require an alternative trigger and amplifier. 

Insolation could induce anomalous tropical SSTs that determine 
the strength of Hadley circulation and position of the STF. Our SST 
and productivity records show a strong signal at the 100-kyr and 
400-kyr frequencies of eccentricity (Fig. 3). Eccentricity modulation 
of precession may lead to anomalous tropical SSTs owing to a 
truncated response”, or to the double maximum of daily irradiation 
characteristic of the tropics’’** rather than the high latitudes. 
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Forcing at 100-kyr and 400-kyr frequencies is amplified, relative to 
half-precession, by increasing the fraction of the year over which the 
maximum and minimum equatorial insolation is time-averaged”*. 
Such time-averaging may best approximate the thermal inertia of the 
tropical SST response to maximum equatorial insolation. 
Anomalously cool tropical SSTs, caused by low maximum equatorial 
insolation (modern and about 400 kyr ago), could reduce the Hadley 
cell (the tropical atmospheric convection cell driven by equatorial 
heating), draw the STF northwards* and act as a trigger for the 
proposed AMOC positive feedback for exceptional global cooling 
during MIS-12 and MIS-10. 

An implication of the amplification of cooling during MIS-12 and 
MIS-10 is that there must be some decoupling between Antarctic 
climate, global climate and pco, (ref. 1). The central Antarctic ice 
sheet may be decoupled from global climatic shifts either by the slow 
transmission of signals across a strengthened glacial ACC” or by 
the enlarged extent of sea ice during the glacial periods insulating 
the Antarctic continent from changes in the ocean, analogous to the 
buffering of Greenland during the extreme cold of Heinrich events by 
sea ice and the southward migration of the deep ventilation site”. 

It appears that pco, can be decoupled from climate at glacial 
extremes such that insolation modulates the maxima and minima 
of climate. Modulation of equatorial insolation provides a way of 
amplifying eccentricity forcing into the climate cycles of Pleistocene 
glaciations. A proposed mechanism of links between topographic 
barriers, the STF and the AMOC may explain the puzzling fact that 
most palaeo-records indicate that glacial periods have different ampli- 
tudes. Understanding the effect of the Agulhas valve on ocean circula- 
tion is important for modern and future climate studies, as illustrated 
by recent work on decadal variations of the southwesterly winds®. 


METHODS SUMMARY 


The chronology of core MD962077 relies on the last three geomagnetic reversals 
and tuning of the foraminiferal 5'°O to a global stack. Alkenones were analysed at 
CEREGE by gas chromatography using an automated Dionex accelerated solvent 
extractor (ASE-200) to perform lipid extraction. Core MD962077 was sampled 
every 10-20cm in plastic vials to measure U, Th and other trace elements by 
inductively coupled plasma mass spectrometry (ICP-MS). We measured 5'%O, in 
parts per thousand relative to the Vienna Pee Dee Belemnite standard (%0 VPDB) 
on the multispecies coccolithophore fraction, with minimal contamination from 
other carbonate components such as foraminiferal fragments. The foraminiferal 
5'80 record was previously published’. Given the limited constraints on long- 
term whole-ocean changes in 8/80, and salinity at core MD962077, we have not 
deconvolved a record of SST from planktonic 3!8O. Nonetheless, MIS-12 8'%0 is 
heavier than MIS-6 by 0.5%0, which is equivalent to a maximum estimate of 
additional cooling of ~2°C. The coccolith fraction 8'%O records the largest 
isotopic change: MIS-12 is heavier than MIS-2 by 1.5%, but changes in the 
coccolithophore assemblage occur on these timescales in core MD962077”° and 
large interspecific fractionation factors of up to 4%o can account for a fraction of 
this change. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 


The chronology of core MD962077 relies on the last three geomagnetic reversals 
and tuning the foraminiferal 5'8O to the global stack”. Alkenones were analysed 
at CEREGE by gas chromatography following the method of ref. 33 except that 
an automated Dionex accelerated solvent extractor (ASE-200) was used to 
perform lipid extraction**. Proper identification and quantification of alkenones 
was verified by gas chromatography mass spectrometry. The analytical precision 
of the method is about 0.01 units for US;7’ (roughly equivalent to 0.3 °C) based 
on repeated extraction of internal laboratory sediment standard during the 
course of this study. The accuracy of our procedure has been checked within 
the framework of the international alkenone intercomparison*’. For simplicity 
and to allow comparison with other palaeo-data sets, the SST values were calcu- 
lated using the equation of ref. 36 which is equivalent to that derived from the 
core-top compilation*’. Culture studies show that there is no systematic offset 
between SST relationships of Gephyrocapsa oceanica and Emiliania huxleyi’. 
We measured 5'°O on the multispecies coccolithophore fraction, with 
minimal contamination from other carbonate components such as foraminiferal 
fragments. The methodology for separating the coccolithophore fraction was 
based on that of ref. 39. All plasticware was acid-cleaned before use, and 
deionized water used throughout. 0.5 g of sediment was sieved using 90% proof 
ethanol through a <20 um sieve. The collected slurry was poured into 15 ml 
centrifuge tubes and allowed to settle for 10 min. The supernatant containing the 
suspended particles was pipetted into a second tube and allowed to settle for a 
further 24h. The supernatant, considered to contain the major clay fraction, was 
pipetted away and filtered for recycling. The remaining <10 um fraction was 
dried and a subsample taken and rinsed three times with deionized water to 
ensure elimination of any relict ethanol. A small fraction was weighed into boats 
for analysis by stable isotope mass spectrometry (PRISM at Harvard University). 
The foraminiferal 5'°O record was previously published’: for each analysis, 60 
specimens of Globorotalia inflata were picked from the 250-350 um fraction and 
analysed at the National Taiwan Ocean University in Keelung using standard 
procedures. Given the limited constraints on long-term whole-ocean changes in 
8/80, and salinity at core MD962077, we have not deconvolved a record of SST 
from planktonic 8/80. Nonetheless, MIS-12 5'80 is heavier than MIS-6 by 0.5%o, 
equivalent to a maximum estimate of additional cooling of ~2 °C. The coccolith 
fraction 8'°O records the largest isotopic change: MIS-12 is heavier than MIS-2 
by 1.5%, but changes in the coccolithophore assemblage occur on these time- 
scales in this core” and large interspecific fractionation factors of up to 4%o can 
account for a fraction of this change’. Measuring three thermometric proxies 
such as U3,’ of alkenones and 8!%O of both foraminifera and coccoliths is 
important because these independent proxies provide confirmation of temper- 
ature trends. Indeed, it is crucial to be able to rule out possible artefacts related to 
species, seasonality or habitat depth of signal markers and of post-deposition 
processes such as sediment mixing. Differential smoothing or lagging can occur 
in sediments between different size fractions of the same core*'. In core 
MD962077, the 5!80 of coccoliths is measured in the same fine fraction as 
alkenones, while the 8'°O of foraminifera belong to the coarse fraction. When 
similar trends are observed for the three proxies (as is the case), it is possible to 
rule out such an artefact as an alternative explanation to temperature changes. 
Core MD962077 was sampled every 10-20 cm in plastic vials to measure U, Th 
and other trace elements by ICP-MS. Procedures are similar to those used in our 
previous work’ except that ICP-MS measurements were performed with a 
Perkin-Elmer Elan 6000 at Service d’Analyse des Roches et des Mineraux 
(SARM) of the CNRS in Nancy, France. For both elements, the uncertainty is 
<5% and the determination level is 0.01 p.p.m. Uauin concentrations were 
calculated by subtracting a crustal component determined from the measured 
Th concentration and a detrital Th/U weight ratio of 4.71 (ref. 43). The detrital 
U correction is relatively small (about 1 p.p.m. on average) in comparison with the 
observed U variations. Moreover, U and Uautn profiles are roughly parallel and the 
two records exhibit the same variability. Therefore, using either the U or Uauth 
profile would not change the interpretation of our results. All concentrations 
shown in Fig. 2b were converted to concentrations on a carbonate-free basis 
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(c.f.b.), which is the conventional procedure to account for the effect of dilution 
by carbonate (see ref. 42 for example). The Usutnh, TOC and C37;o concentration 
profiles with and without this correction are roughly parallel and using raw con- 
centrations would lead to the same interpretation. 

TOC, C37tor and Uautn concentrations have been used as proxies for the bio- 
logical productivity in the Southern Ocean related to migrations of the STF and 
the subantarctic front***’. Following the work of these authors and from our own 
studies in other regions**“* it is clear that these proxies cannot be scaled linearly 
to productivity. Indeed, the organic carbon raining from the surface is partly 
remineralized in the sediments, which creates more-reducing conditions, lead- 
ing to U diffusion in pore waters and trapping in the sediments“. These reducing 
conditions also lead to preferential preservation of lipids such as alkenones, 
which always exhibit a much larger dynamic range than observed with TOC”. 
Trace metals can also be remobilized by additional processes such as oxic front 
diffusion and bioturbation”’. All these natural processes are superimposed on 
each other, making these markers only qualitative, although very sensitive, 
proxies for the surface productivity. Despite these complications, the strength 
of our approach is that we consider all three proxies measured independently. 
When TOC, C37o¢ and Usuth trends agree, indicating enhanced values, it is 
reasonable to assume that the observed maxima are associated with a significant 
productivity increase linked to the STF migration. MIS-12 and MIS-10 are 
indeed characterized by maxima of TOC, C37¢4, and Unuth- 
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Global patterns of speciation and diversity 


M. A. M. de Aguiar’”, M. Baranger’”, E. M. Baptestini’, L. Kaufman’* & Y. Bar-Yam’ 


In recent years, strikingly consistent patterns of biodiversity have 
been identified over space, time, organism type and geographical 
region’”. A neutral theory (assuming no environmental selection 
or organismal interactions) has been shown to predict many 
patterns of ecological biodiversity*’. This theory is based on a 
mechanism by which new species arise similarly to point muta- 
tions in a population without sexual reproduction. Here we report 
the simulation of populations with sexual reproduction, mutation 
and dispersal. We found simulated time dependence of speciation 
rates, species—area relationships and species abundance distribu- 
tions consistent with the behaviours found in nature’ ’. From our 
results, we predict steady speciation rates, more species in one- 
dimensional environments than two-dimensional environments, 
three scaling regimes of species—area relationships and lognormal 
distributions of species abundance with an excess of rare species 
and a tail that may be approximated by Fisher’s logarithmic series. 
These are consistent with dependences reported for, among 
others, global birds* and flowering plants’, marine invertebrate 
fossils®, ray-finned fishes’, British birds*? and moths’, North 
American songbirds'', mammal fossils from Kansas’? and 
Panamanian shrubs’*. Quantitative comparisons of specific cases 
are remarkably successful. Our biodiversity results provide 
additional evidence that species diversity arises without specific 
physical barriers®''"*. This is similar to heavy traffic flows, where 
traffic jams can form even without accidents or barriers’’. 

Speciation studies have identified conditions under which 
speciation events can occur. Allopatry is considered the dominant 
form of speciation. Studies of partial barriers (parapatric speciation) 
and debates about sympatric speciation focus on whether such types 
of speciation are possible and, if so, whether they are plausible in 
nature’®'’, Simulations of homogeneous spatial environments have 
suggested the necessity of trait-based variation of sexual selection, 
competition or diffusion rate, or use outbreeding depression’*”. 
Studies exist indicating these may not be necessary even in sympatry, 
at least where there is no genetic linkage*'**. Significantly for the 
results we report here, no simulations of speciation that include 
sexual mating have yet been shown to yield the observed patterns 
of species diversity. 

We simulated the evolution of a population whose members, at 
the beginning, are uniformly distributed in space and have identical 
genomes. The population evolves under the combined influences of 
sexual reproduction, mutations and dispersal. During reproduction, 
potential mates are identified from among those in a spatial region 
around an individual (specified by a spatial mating distance, S) whose 
genomes are sufficiently similar to that of the individual (specified by a 
genetic mating distance, G). This is a minimal form of sexual selection, 
essential (necessary but not sufficient) for speciation, called assor- 
tative mating” (postzygotic genetic incompatibilities may have a role 
but are not essential’). A mate is chosen from this set at random. 
Reproduction with crossover and mutation occurs. An offspring is 
then dispersed within a region around the originating and expiring 


parent. Genetic variation grows over time, due to mutation and 
recombination. We identify a species as a group of organisms repro- 
ductively separated from all others by the genetic restriction on mating 
and connected among themselves by the same condition, without 
requiring all members of the group to be able to mate with each other. 
An example of such a case is a ring species’, in which progressive 
differences along a chain of individuals result in individuals at one end 
not being able to mate with those at the other end. We consider this a 
single species, owing to the ongoing gene flow. No condition on spatial 
proximity is imposed on the members of a species. 

We simulated many variants of the model; the one we report here 
uses haploid and hermaphroditic individuals placed randomly in the 
space (for convenience annotated as a 128 X 128 or 256 X 256 lattice). 
Multiple individuals can exist at the same site but typically do not, 
owing to low densities. Unlike parapatric simulations that are 
otherwise similar***°, our density of organisms is less than one per site 
(there are no a-priori-defined groups whose genetic divergence is a 
possible mechanism of speciation), so our speciation may be termed 
‘topopatric’. Genomes consist of binary strings of length 125. Variation 
in the number of offspring of an individual was included by randomly 
choosing a neighbour to reproduce, instead of the expiring parent, in 
Q= 30% of cases. Other assumptions, including variations of para- 
meters or more basic changes such as separating two sexes, give qua- 
litatively similar results (Methods). 

Depending on the model parameters, we find sexual isolation of 
subpopulations in genome space (speciation) and spatial inhomo- 
geneity in spite of the absence of geographical barriers, resource 
gradients or natural selection. We show an example in Fig. 1. The 
first branching into two species occurs after about three hundred 
generations; additional speciation events take place later. Each of 
the species is coloured differently. Species extinctions also occur, 
owing to random variation in population size and resulting in a 
variable number of species over time. Although species are not 
homogeneously distributed and tend to be concentrated in different 
regions, there is considerable overlap among their domains. Newly 
branched species are genetically close to their parent species, whereas 
species formed in the far past reach the maximum average genetic 
distance in the genome space. The species that form are completely 
separated by genetic difference according to the mating condition. 
The spatial and genetic clustering occurs even though the underlying 
mating behaviour couples organisms in both physical and genetic 
space. 

In Fig. 2, we show the effect on the pattern of speciation of varying 
the spatial, S, and genetic, G, mating distances. Speciation becomes 
less likely as the critical mating distances increase. Neither spatial nor 
genetic restrictions on mating alone leads to speciation. The former is 
required for the distinction of spatial location, and the latter is 
required for genetic isolation. The largest number of species arises 
for small values of both parameters. However, for intermediate values 
of spatial distance (for example S = 6 in Fig. 2), intermediate values 
of genetic distance yield the most species, as small values of the 


'New England Complex Systems Institute, Cambridge, Massachusetts 02138, USA. *Universidade Estadual de Campinas, Unicamp, Campinas, Sao Paulo 13083-970, Brazil. 
University of Arizona, Tucson, Arizona 85719, USA. “Boston University, Boston, Massachusetts 02215, USA. 


384 


©2009 Macmillan Publishers Limited. All rights reserved 


NATURE| Vol 460|16 July 2009 


T = 450 


LETTERS 


T = 500 T= 550 


Figure 1| Time evolution for 2,000 individuals on a 128 x 128 lattice. Maximum mating distance between two organisms is S = 6 lattice cells, and G = 20 
genetic differences out of 125 total genes. Reproductively isolated species are shown in different colours. 


genetic distance reduce the probability of genetic fluctuations arising 
from mutation and recombination that lead to speciation. 

The average spatial density of a species over a population 
(Supplementary Fig. 2a, b) is consistently well described by a 
Gaussian 


p(r)=poe 

where r is the distance from the geographic centre, ~9 is approxi- 
mately the average density, N/L’ (where N is the number of organ- 
isms and L’ is the area), and, in the regime of large numbers of 
species, R is given by 


R=a ce + BS 
m 
with « = 0.18 and f = 0.26. By integration, the number of species is 
Ns =N/np oR’. 
The model describes the spatial dynamics of speciation, from 
which we can obtain the time dependence, the spatial dependence 
(species—area relationship) and the distribution of species sizes (species 


Figure 2 | Spatial snapshots after 1,000 generations for 2,000 individuals 
0na128 x 128 lattice. S and G areas shown. Colours are for different species. 


abundance distribution). In Fig. 3 (see also Supplementary Fig. 2c, d), 
we summarize the following four key predictions for species diversity 
consistent with observation. 

First, we predict constant rates of new species over long times, as 
captured by the fossil and genetic data. Figure 3a shows how the 
number of species varies over time. After a transient period, during 
which mutation and recombination increase the variation in the 
initially identical population, the number of species increases rapidly 
and reaches a steady state subject to statistical fluctuations. The exist- 
ence of a steady state is sufficient to imply a constant rate of new 
species arising over long times. This result is consistent with the 
record of marine invertebrate fossils from the Phanerozoic eon’. 
Over shorter time frames, genetic differences among North 
American songbirds"! and the fossils of mammals from the Meade 
basin of Kansas’* show remarkably steady speciation rates (consistent 
with our results) and do not correlate with glaciations, disproving 
this once-dominant view, which was held because of the expectation 
that speciation is promoted by physical barriers. A constant spe- 
ciation rate in the absence of landscape forcing is generally supported 
by the literature on whole-tree cladistic analysis**. This does not 
discount the potential importance of habitat expansion, such as 
occurred for North American organisms following glacial retreat. 

Second, we predict a high diversity of freshwater versus marine ray- 
finned fishes when measured by area. Figure 3b shows that the number 
of species formed increases with the available area if the density of 
individuals (and corresponding values of other parameters) is kept 
constant. In one-dimensional environments, the number of species 
formed is much greater per unit area. Thus, the lower dimensionality 
of rivers relative to oceans has an enhanced effect on speciation even 
when specific barriers do not exist. We studied one-dimensional and 
two-dimensional cases, and river networks may be considered fractal 
with dimension between one and two, but the same considerations 
apply. This is consistent with the high diversity of freshwater ray- 
finned fishes when measured by area and similarly the preponderance 
of near-shore shallow-water marine species’. 

Third, we predict a species—area relationship that is largely inde- 
pendent of the type of species or geographical location, is nearly 
linear on large and small scales and has intermediate scaling beha- 
viour with a low slope that varies among ecological communities’. 
The species—area relationship within a system is shown in Fig. 3c ona 
log-log plot. The simulated results correspond very well to those 
found, among others, in birds’ and flowering plants’ sampled on 
local to global scales, as well as exhaustive studies of trees and shrubs 
in Panama’? (detailed fit, as shown). On intermediate scales, the slope 
can be varied with model parameters (inset, Fig. 3c). 

Finally, we predict the distribution of species abundance to be well 
fit by a lognormal distribution with excess rare species. Simulated 
species abundance distributions (detailed correspondence shown in 
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Figure 3 | Species abundance. a, Simulated number of species as a function 
of time (S = 5, G = 20, also used for b, c, d). Blue fits are a guide to the eye for 
the increase (solid) and the steady state (dashed). Time measured in 
generations. b, Simulated number of species for increasing system size in one 
(1D) and two (2D) dimensions (P < 0.0001). Area measured in lattice units. 
Error bars, s.e.m. ¢, Empirical species—area relationship from Panamanian 
trees'’ (red; upper and right-hand axes) compared with simulation (black; 
lower and left-hand axes) of samples with lattice sizes of up to 128 X 128, 
taken from a single simulation of 8,000 individuals on a 256 X 256 lattice 
(ranges shown are standard deviations across multiple simulations) fitted by 
anchoring lowest and highest points (R? = 0.88). Inset shows smallest slope 
of simulations as a function of Q (error bars, s.e.m.). d, Empirical species 
abundance distribution of Panamanian trees’ (red; upper and right-hand 


Fig. 3d, e) fit observations of Panamanian trees’ and British birds*”, 
and are of the form found for moths'® and various studies of trees”, 
among others’. As the number of samples decreases relative to the 
number of species, the right-hand tail of the distribution, consisting 
of common species, dominates the sampling, leading to a distri- 
bution that can be fit both by the tail of the lognormal and by 
Fisher’s logarithmic series” (Fig. 3f), as found among many observa- 
tions'* (Supplementary Fig. 1). Our species abundance rank distri- 
butions also display the observed S-shaped curve’. 

The key success of neutral theory with a point-mutation model of 
species formation”’ has been its ability to describe the forms of 
species—area relationships and species abundance distributions 
found among many species. Figure 3c—f shows that our model yields 
similar results. These results are found even though our mechanism 
includes not only the appearance of localized species, but also the 
splitting (fission) of a larger species into smaller populations. This 
does not correspond to allopatric speciation, owing to the absence of 
geographic barriers. Such splitting leads to key differences from the 
point-mutation model. For example, the rate at which the steady 
state arises is much faster because each new species need not start 
from a single individual. Moreover, the formation of species is not 
independent of genealogical history. Species fission might be 
expected to significantly change the behaviour of the species abund- 
ance distributions, but we nevertheless find a lognormal distribution 
with excess rare species and detailed agreement with observed species 
abundance and species—area relationships. 

We note that there are also regimes of the model in which spe- 
ciation is quite difficult. This accounts for the existence of some 
observations where geographic barriers and selection do have an 
important role; for example, observations of greenish warblers"™ have 
shown the direct impact of geographic barriers and genetic variation 
across continent-wide environmental gradients. 

We have shown that our distributions of species reach a steady 
state rapidly, with extinctions balanced by speciation consistent with 
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axes) compared with simulation (black, with error bars; lower and left-hand 
axes; a = 3.1, P= 0.01) and a lognormal distribution (blue; va = 3.9, 

P= 0.08). Inset shows a lognormal distribution fitted only to highest and 
rightmost simulation points. Error bars, s.e.m. e, As in d (lognormal only in 
inset, and dashed red line added for clarity), but for British birds (S = 6, 

G= 10, N= 8,000, Q= 0.5) (7 5.5, P< 0.0001). The distribution is 
consistent with a larger effective mating distance, although other parameters 
may affect this conclusion. Even migratory birds satisfy the model 
assumptions of mating proximate to their locations of birth*’. Error bars, 
s.e.m. f, Simulated species abundance for sampling over a 64 X 64 subset ofa 
256 X 256 lattice (S = 6, G = 10, N = 8,000; Supplementary Fig. 1), fitted to 
two essentially overlapping curves: lognormal (blue; 7” = 0.009, P < 0.0001) 
and log-series (cyan; 7 = 0.005, P< 0.0001). Error bars, s.e.m. 


observations, and that species—area relationships and species abund- 
ance distributions are consistent with those frequently found in 
nature. The general forms of the distributions are as observed, and 
specific simulations are in remarkable correspondence with specific 
observations. Underlying the agreement is a simple model of a 
sexually reproducing population in a spatial context with mating 
restrictions depending on genetic and spatial distance—these are the 
essential properties for the dynamic biodiversity patterns. Our model 
dynamically subdivides spatial regions, which may be called topopatric 
speciation. The spontaneous creation of new spatial patterns is an 
example of the general process of spontaneous pattern formation 
and symmetry breaking found in many systems”, including traffic 
jams’. Pre-existing geographic features and variation may influence 
patterns, but are not necessary for their formation. The relationship 
between this process and niche-ordering models” could be explored. 


METHODS SUMMARY 


We used agent-based simulations in which agents identified by geographical 
location and genotype undergo sexual reproduction in pairs limited by geo- 
graphical and genetic proximity. Each offspring replaces a parent and differs 
in genotype according to genetic inheritance from both parents, with crossover 
and mutation, and in geographical location by dispersal. According to tests of 
multiple model variants, including parameter variation, sequential and syn- 
chronous mating, hermaphroditism and two sexes, single or multiple crossovers, 
direct assortative mating and separate fitness and sexual selection genetic com- 
ponents, our results apply quite generally, with the key properties needed being 
the limitations on spatial and genetic distances of mating. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 


For the simulations reported in the paper, we considered N haploid and 
hermaphroditic individuals placed randomly on a lattice (128 X 128 or 
256 X 256) where multiple individuals can exist at the same site. Genomes consist 
of binary strings of length B, and the genetic distance between two individuals is 
the number of differences along the genome. Reproduction starts with an indi- 
vidual looking for a mate inside its spatial mating radius, S. To qualify, an indi- 
vidual must have a genetic distance no greater than G from the seeker, reflecting 
differences in traits or offspring viability. From all such potential mates, we select 
one at random. If there are fewer than a minimum number of possible mates (an 
additional model parameter, P), we weaken the spatial and genetic constraints as 
follows for that mating: S> $+ 1 and G> G+ 1. If the number of available 
mates is still smaller than P, the process is repeated. (We also investigated letting 
S— $+ 1 without changing G and, if there were not enough mates in the entire 
space, choosing a neighbour to reproduce instead, with similar results.) 

Each individual is a seeker once in a generation, but there is a probability, Q, that 
it will not reproduce at all. When that happened, another individual is randomly 
selected from its spatial neighbourhood to reproduce in its place, with the offspring 
placed in the location of the original seeker. This maintains an approximately 
homogeneous demographic density, corresponding to a fixed local carrying capa- 
city. Each reproduction occurs with one genetic crossover (the genome up to a 
certain point is from one parent and after that is from the other parent), reflecting 


nature 


genetic linkage. Mutation occurs with a probability yu. Offspring dispersal occurs 
with probability D; the offspring is placed at one of the 20 neighbouring sites, 
chosen randomly. For the results reported, B= 125, «= 0.001, Q= 30%, P=8 
and D= 0.01. Parameter variation affected details, but not the overall behaviour. 
For example, robustness can be demonstrated by setting Q = 0%, D = 0 and P= 1. 
The effects of varying S and Gare described in the text. Results for the number of 
species and distributions are obtained by time-averaging over 10,000 generations, 
sampling every 25 generations after reaching a steady state. 

The results reported here are for asynchronous mating, although many varia- 
tions of the model were implemented, with no essential changes in the behaviour. 
Synchronous mating gave similar results, although speciation times were typically 
twice as long. We also considered models that indirectly impose the genetic 
mating distance through selection on genetic differences that affect offspring 
viability and genetic differences regulating behavioural traits that promote repro- 
ductive isolation, models with multiple crossovers, models with adaptive changes 
in spatial and genetic mating distances when few potential mates were available, 
and models with populations of two sexes. We also obtained speciation for sig- 
nificantly lower mutation rates. For 44 = 0.00001, speciation occurred for S = 3, 
G=7 and N= 2,500 in a 128 X 128 lattice after 7,000 generations. Robustness 
suggests that improving the realism of the model by including other assumptions 
(details of mating and recombination, diploidy and dioecism) will not affect the 
basic results. 
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Evolution of a malaria resistance gene in wild 


primates 


Jenny Tung'”, Alexander Primus’”, Andrew J. Bouley’, Tonya F. Severson’, Susan C. Alberts 


& Gregory A. Wray’”?* 


The ecology, behaviour and genetics of our closest living relatives, the 
nonhuman primates, should help us to understand the evolution of 
our own lineage. Although a large amount of data has been amassed 
on primate ecology and behaviour, much less is known about the 
functional and evolutionary genetic aspects of primate biology, espe- 
cially in wild primates. As a result, even in well-studied populations 
in which nongenetic factors that influence adaptively important 
characteristics have been identified, we have almost no understand- 
ing of the underlying genetic basis for such traits. Here, we report on 
the functional consequences of genetic variation at the malaria- 
related FY (DARC) gene in a well-studied population of yellow 
baboons (Papio cynocephalus) living in Amboseli National Park in 
Kenya. FY codes for a chemokine receptor normally expressed on the 
erythrocyte surface that is the known entry point for the malarial 
parasite Plasmodium vivax'*. We identified variation in the cis- 
regulatory region of the baboon FY gene that was associated with 
phenotypic variation in susceptibility to Hepatocystis, a malaria-like 
pathogen that is common in baboons*°. Genetic variation in this 
region also influenced gene expression in vivo in wild individuals, a 
result we confirmed using in vitro reporter gene assays. The patterns 
of genetic variation in and around this locus were also suggestive of 
non-neutral evolution, raising the possibility that the evolution of the 
FY cis-regulatory region in baboons has exhibited both mechanistic 
and selective parallels with the homologous region in humans**. 
Together, our results represent the first reported association and 
functional characterization linking genetic variation and a complex 
trait in a natural population of nonhuman primates. 

In humans, a transition from the wild-type T variant to a C variant 
at a single polymorphic site in the FY cis-regulatory region causally 
abolishes all expression of this gene in erythrocytic precursors. As a 
result, C homozygotes at this site are strongly protected from infec- 
tion by P. vivax’, and a lower level of protection is also conferred on 
C/T heterozygotes'®"'. The C variant has apparently arisen indepen- 
dently at least twice in geographically distinct human populations (in 
Africa and in Papua New Guinea”''), and has been driven to high 
frequencies on at least two haplotypic backgrounds within Africa®. 
Additionally, the pattern of variation in the cis-regulatory region as a 
whole strongly indicates a historical pattern of natural selection in 
different populations around the world, probably as the product of 
directional selection in some populations (for example, local positive 
selection), and a complex mix of selection and demographic history 
in others®*. The unusual evolutionary history of this locus led us to 
investigate the pattern of genetic variation in its baboon homologue, 
and to explore the possibility that it might also explain phenotypic 
variation in parasite infection in a wild primate population, the well- 
studied baboon population of the Amboseli basin in East Africa’*""*. 


1,2,3,4% 


Baboons are not generally infected by Plasmodium in the wild, but 
are vulnerable to infection by several closely related haematoprotozo- 
ans*° including Hepatocystis kochi, a blood parasite nested within the 
paraphyletic Plasmodium genus’. Hepatocystis parasites do not pro- 
duce the cyclical fever spikes typical of malaria in humans, but do 
produce anaemia and visible merocyst formation, followed by scarring 
on the liver’. 

We tested for the presence of Hepatocystis parasites by screening 
DNA samples extracted from baboon blood for 190 individuals in the 
Amboseli baboon population. We found a high incidence of 
Hepatocystis infection in the Amboseli population (61.9%), although 
rates of infection varied substantially between different social groups 
and over time, possibly because of differences in home range and 
hence exposure to the vector, a biting midge’. 

In 174 of 190 baboons that we screened for Hepatocystis, we also 
sequenced the region of baboon DNA homologous to the annotated 
human FY cis-regulatory region. We identified six single nucleotide 
polymorphisms (SNPs) in the baboon FY cis-regulatory region (Fig. 1 
and Supplementary Fig. 1; the malaria-associated SNP documented in 
humans was invariant in the baboons). Hepatocystis infection was sig- 
nificantly associated with an A/G variable site in the FY cis-regulatory 
region, in a model that took social group (a significant source of vari- 
ance in infection) and genetic background into account. The risk of 
infection decreased as the number of G alleles an individual carried 
increased (P< 0.012, n = 174; Fig. 2). 

We also investigated whether FY cis-regulatory variation in 
baboons causally influences gene expression, as the C/T variant does 
in humans. We collected 101 samples of RNA-preserved blood from 
adults in six baboon social groups between 2004 and 2008, and used 


(CA), 


3.7 kb 


Figure 1| Schematic of the baboon FY gene (not to scale). Boxed regions 
are regions of the gene present in mature messenger RNA (open boxes are 
untranslated regions; black boxes are protein coding sequence). Grey lines 
indicate untranscribed regions. The bent arrow indicates the start of 
transcription. The downward arrows show two baboon SNPs used as 
markers for the pyrosequencing assays. The vertical black bars mark cis- 
regulatory SNPs in baboons, with the Hepatocystis-associated SNP labelled 
A/G and the allelic-imbalance-associated SNP labelled C/T. The dashed 
vertical bar marks the location of the functional SNP known in humans. The 
dashed horizontal lines provide relative distances and/or sizes of features. 
The location of the (CA),, microsatellite used in the F,, analyses is also shown 
upstream. 
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Figure 2 | Genotype at the FY cis-regulatory A/G SNP is associated with 
Hepatocystis infection. The proportion of uninfected individuals is shown 
in grey, and the proportion of infected individuals is shown in white. 

a, Results for the entire sample set (n = 174; P< 0.012); b, results only for 
members of the six groups with high prevalence (>75%) of Hepatocystis 
infection (n = 111; P< 0.004). Numbers below each genotype show the 
number of individuals for the given genotype. 


these samples to measure allele-specific expression at the FY locus 
using pyrosequencing. Specifically, we investigated whether the level 
of FY expression driven by one cis-regulatory FY allele differed from 
the level of FY expression driven by the other cis-regulatory FY allele, 
within the same individual. Because allele-specific expression com- 
pares the relative amounts of gene expression within individuals, it 
controls for effects on gene expression operating in trans, such as 
those produced by genetic background or by environmental main 
effects'”'*. If the two alleles within an individual drive expression 
differently (allelic imbalance), that individual is likely to harbour a 
functional cis-regulatory variant that influences gene expression. 

We measured allele-specific expression in 38 individuals (all the 
individuals among the 101 RNA-sampled baboons that were hetero- 
zygous at a transcribed pyrosequencing assay SNP: see Fig. 1). 
Average log, fold-change differences in expression between alleles 
within heterozygous individuals ranged from —0.002 (no difference 
between alleles) to 2.13 (substantial difference between alleles). This 
suggested that one or more common functional cis-regulatory var- 
iants influenced expression of the baboon FY gene in the Amboseli 
population. We predicted that if a cis-regulatory variant contributes 
to variation in gene expression, then individuals heterozygous at the 
cis-regulatory site would show significantly higher levels of allelic 
imbalance than individuals homozygous for the same variant. 

Genotypes at four of the six SNPs in the baboon FY cis-regulatory 
region were sufficiently variable to test for an association with allelic 
imbalance. Genotype at the SNP closest to the start of transcription, a 
C/T transition, was significantly associated with allelic imbalance in 
the predicted direction: heterozygotes exhibited higher levels of allelic 
imbalance than homozygotes (P< 0.002, n = 38; Fig. 3). However, 
this site explained only 22.0% of the overall variance in the allelic 
imbalance samples, after taking into account year of sampling. 
These results suggested that this C/T SNP functionally influences gene 
expression of the FY gene within the baboon population, but that 
additional cis-regulatory variants probably also play a part. 

We next investigated the A/G SNP in the FY cis-regulatory region 
that we had associated with Hepatocystis infection risk. Of the 38 
individuals for whom we measured allele-specific expression, 37 were 
heterozygous for this A/G SNP. Hence, we could not compare allelic 
imbalance levels between heterozygotes and homozygotes at this site. 
We therefore tested this variant for possible functional effects using 
an in vitro approach in cell culture. We also used this framework to 
test further whether the C/T variant that was associated with allelic 
imbalance causally influenced FY expression. For each of these two 
regulatory SNPs (A/G and C/T), we built two plasmid constructs 
consisting of the FY cis-regulatory region linked to the firefly lucifer- 
ase gene, such that the two constructs differed only at the variable site. 
We then tested the ability of these constructs to drive gene expression 
in a human erythroleukaemic (HEL) cell line. For the C/T SNP, the T 
allele construct drove significantly higher levels of expression than 
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Figure 3 | Allelic imbalance is associated with FY cis-regulatory genotype. 
Height of the bars shows the mean for each genotypic class (heterozygotes: 
0.329 + 0.116 s.e.m.; homozygotes: —0.285 + 0.086 s.e.m.). The y axis gives 
the residuals of log-transformed allelic imbalance on year of sampling 

(n = 38). Heterozygotes at the C/T SNP exhibit high values of allelic 
imbalance relative to homozygotes at the C/T SNP after controlling for the 
effect of year of sampling. High values indicate that the two alleles of the FY 
gene are transcribed at different levels within individuals, and suggest a 
functional cis-regulatory role for this SNP. 


the alternative C allele construct (P< 0.0001; Fig. 4a). Similarly, for 
the A/G SNP, the G allele construct drove significantly higher levels of 
expression than the alternative A allele construct (P< 0.0001; 
Fig. 4b). These results suggest that both SNPs have the capacity to 
drive differential expression of the FY gene. In the case of the C/T 
SNP, for which both in vivo and in vitro analyses were possible, the T 
allele was associated with higher levels of expression in both experi- 
ments. Unlike the human case, in which one regulatory SNP results in 
null expression, all the baboon haplotypes we tested drove robust 
expression of the gene. 

Together, these data indicate that the FY cis-regulatory region is 
associated with parasite infection in a wild population of baboons, 
and that functional sequence variants within this region causally 
influence the level of expression of the FY gene. As in humans, vari- 
ation in gene expression at the FY locus may therefore be important 
in parasite susceptibility, either through altering the direct access of 
Hepatocystis to baboon erythrocytes or, as has recently been demon- 
strated in humans, by altering a more general property of the 
immune system, such as relative white blood cell counts’”. 
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Figure 4 | FY cis-regulatory variation drives differential expression in vitro. 
a, The C/T SNP identified through the allelic imbalance measurements. 

b, The A/G SNP that associates with Hepatocystis infection drives differential 
gene expression compared to the alternative allele of the same SNP in cell 
culture. Values on the y axis give the relative ratio of firefly luciferase 
luminescence to a control renilla luciferase reporter. Human constructs 
from a normal-expressing individual (haplotype from the CEPH/Utah 
HapMap panel) and a null-expressing individual (Luhya) and an empty 
vector are shown for comparison. Error bars show s.e.m. for each construct. 
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These results suggest that the genetic basis of phenotypic variation 
in different primate species can exhibit a remarkable degree of 
parallelism. In this case, not only are these similarities present on 
the molecular level or on the level of trait association, as shown by 
previous work”®”!, but they also extend to the mechanism that links 
molecular and phenotypic variation (which is probably gene expres- 
sion). Given these parallelisms, baboons, like humans, may also 
exhibit evidence of non-neutral evolution at the FY cis-regulatory 
region. We detected an increased level of population differentiation 
among East African baboon populations around FY, by comparing a 
FY-linked microsatellite with 35 neutral microsatellites (Fy = 0.31, 
P<0.029; range of F,, for the neutral markers was 0.008-0.346; F,, 
is a metric describing genetic divergence between populations based 
on allele frequency differences at variable sites; Supplementary Fig. 3). 
We also detected a higher value for the Tajima’s D statistic (D = 1.26) 
in this region relative to nine of nine other resequenced putative cis- 
regulatory regions in the Amboseli population and 11 of 12 rese- 
quenced transcribed regions (range of D for all other loci was — 1.60 
to 2.12). The only locus with a higher value of D, a transcribed portion 
of the gene MSR1, exhibited an even more extreme value than that 
identified for the MHC DQA1 promoter in baboons”, which is known 
to evolve under strong trans-specific balancing selection”®. 

Interestingly, a sliding window analysis showed that the peak 
values of D corresponded well with the Hepatocystis and allelic- 
imbalance-associated SNPs (Supplementary Fig. 3). Given that rates 
of Hepatocystis infection appear to vary across different populations 
(30% in the Masai Mara Reserve, Kenya, n= 10; 90% in Mikumi 
National Park, Tanzania, n = 20; see Supplementary Tables 1 and 2), 
these results suggest that the baboon FY cis-regulatory region may be 
subject to a complex selective history similar to the case described in 
humans’”* in which differing levels of pathogen pressure across 
populations are associated with high levels of population differenti- 
ation around the FY gene, and varying signatures of selection within 
populations. 

In spite of the parallels that we have documented for baboon and 
human FY, the functional variants we have identified in baboons are 
not homologous to the known functional variant in humans, which 
reveals that phenotypic variation in different primate species may 
show similar, but not precisely convergent, patterns of evolution. 
Indeed, while in humans the FY—malaria relationship is Mendelian, 
both FY expression and infection by Hepatocystis in baboons are 
clearly complex traits: even individuals homozygous for the 
Hepatocystis ‘resistance’ variant (the G allele at the A/G SNP) suffer 
from parasitism, albeit at a lower rate (52.2% of GG homozygotes 
were infected, versus 67.9% of AA homozygotes, across all study 
groups: see Fig. 2 and Supplementary Fig. 2). Additionally, the in 
vitro cell culture experiments suggest that the G allele of this variant 
actually drives higher expression of FY than the alternative A allele, 
even though the G allele is associated with a lower risk of Hepatocystis 
infection. The relationship between FY gene expression and 
Hepatocystis in baboons is therefore clearly different from that in 
humans, perhaps owing to balancing the cost of infection by other 
blood parasites, some of which are not known to co-occur with, and 
might be excluded by, Hepatocystis™. Alternatively, while the in vitro 
data on the A/G variant strongly suggest that this site has the capacity 
to influence FY gene expression, the direction and magnitude of its 
effects may differ in its natural cellular context. 

This possibility is supported by the differences in magnitude of the 
effect of the C/T cis-regulatory variant in the in vitro transfection 
assays and the in vivo allelic imbalance measurements. In vivo gene 
expression measurements are complicated by variation in genetic 
background and in the environment, both of which can modify func- 
tional cis-regulatory effects***°. Indeed, our results show that even 
baboons that are homozygotes at the C/T site sometimes exhibit allelic 
imbalance in FY expression, suggesting that other, unidentified func- 
tional cis-regulatory variants are also segregating in the population. In 
contrast, in the in vitro comparisons, only a single cis-regulatory site 
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differed between the experimental constructs, thus controlling for 
both environment and genetic backgrounds. Using both approaches 
in tandem can be synergistic: while in vitro experiments can help pin 
down specific functional sites, in vivo results demonstrate that these 
effects are relevant to the biology of individuals in the wild. 

Thus, although identifying the genetic basis for phenotypic vari- 
ation in wild primates poses substantial challenges, we present this 
study as a model to motivate additional evolutionary genetic research 
on natural primate populations. This work is essential if we hope to 
integrate an evolutionary and functional genetic perspective into the 
rich tradition of organismal research on these species. Our results 
demonstrate that patterns of variation in nonhuman primates can 
provide unique insights into the influence of ecological and environ- 
mental factors on genetic and trait variation in humans. Integrative 
research on nonhuman primates should also help us develop a better 
understanding of the evolution of our own species. 


METHODS SUMMARY 


We screened for Hepatocystis in 190 baboons, using Hepatocystis mitochondrial 
DNA specific primers. We sequenced the region homologous to the human FY 
cis-regulatory region in 174 of these baboons. We identified predictors of 
Hepatocystis infection in these 174 individuals using a generalized linear mixed 
model with a binomial error structure, fitting genotype and an estimate of 
genetic background as fixed effects and study group as a random effect. 

Allelic imbalance was assessed via pyrosequencing using RNA collected from 
the study population, and was measured as the log, transformed ratio of the 
expression of one allele versus the other allele, within individuals. We modelled 
allelic imbalance using a general linear mixed model and evaluated the signifi- 
cance of the genotype effect on allelic imbalance using a permutation test. 

We co-transfected human erythroleukaemic cells with experimental con- 
structs or a control construct and a normalization construct. A total of 24 
measurements were made per construct in three independent experiments. 
We compared the measurements for each pair of constructs separately using 
general linear mixed models. 

For the F,, comparison, we genotyped up to 36 polymorphic microsatellite loci 
(35 neutral markers plus one located near the FY cis-regulatory region”’) in ten 
baboons from the Masai Mara Reserve, Kenya, 20 baboons from Mikumi National 
Park, Tanzania, and 12 baboons from the Amboseli basin, Kenya (Supplementary 
Information). We compared the F,, value for the FY-linked microsatellite locus 
with those for the 35 neutral loci. The probability of observing a more extreme 
level of genetic differentiation at the FY-linked locus was assessed by fitting a 
gamma distribution to the F,, values for the neutral loci, and calculating the 
average value of P based on different parameters of the gamma (weighted by 
the probability of each parameter set, given the data). Tajima’s D values were 
calculated for samples from Amboseli using DnaSP version 4.9 (ref. 27). 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 

DNA and RNA sampling. Blood samples for DNA extraction and Hepatocystis 
screening were collected from 190 Amboseli baboons between 1989 and 2008 
(Supplementary Fig. 2)**. DNA was extracted using standard methods. DNA for 
some individuals was whole genome amplified (Qiagen Repli-G Kit). Blood 
samples for RNA extraction were collected in PaxGene RNA tubes from 101 
adult Amboseli baboons darted between 2004 and 2008. RNA was extracted 
using the PaxGene RNA Blood kit (Qiagen), and reverse transcribed into com- 
plementary DNA (ABI High Capacity cDNA Archive Kit). 

Sequencing. We amplified and sequenced the region homologous to the anno- 
tated FY cis-regulatory region in humans in 174 individuals and sequenced or 
genotyped the two pyrosequencing assay SNPs in 150 individuals. To assess 
congruence between the in vitro and in vivo gene expression results for the 
C/T SNP, we inferred haplotype phasing using PHASE 2.1.1 (ref. 29). 
Hepatocystis screen and association with FY. We screened for Hepatocystis in 
all 190 baboons using Hepatocystis mtDNA specific primers, which produced a 
band of approximately 251 base pairs on an agarose gel in the presence of 
Hepatocystis (Supplementary Methods). 

We then fitted the following generalized linear mixed model for 150 individuals, 
using a binomial error structure: 


5 
P(yi, = 1| Gj) = logit Ge + YS Duy + S + b+ :) 
u=1 

where the logit link function has been used to change the expression in 
parentheses into the probability of observing the parasite, given the explanatory 
variables. y is Hepatocystis infection status (y= 1 corresponds to infected; y = 0 
corresponds to uninfected), individuals are indexed by i, and study group is 
indexed by j. f is a fixed effect of genotype, Giz v, is the fixed effect of the 
projection Dj,, on the uth principal component of population structure; S; is a 
random effect of study group; b is the intercept; and ¢ represents model error 
(Supplementary Information). Principal-components analysis of population 
structure was conducted on data from 47 unlinked loci (33 SNPs and 14 micro- 
satellites, Supplementary Table S4). We also included study group as a random 
effect because group is a common source of structure in baboons, and infection 
rates clearly differed between groups. We evaluated the significance of /, the SNP 
effect, as evidence for association between infection and cis-regulatory variation. 
Pyrosequencing. We designed pyrosequencing assays based on two variable 
SNPs in the transcribed region of the FY gene. 38 individuals for which cDNA 
samples were available were also heterozygous at one or both of these sites. For 
each of these individuals, we performed six to eight pyrosequencing reactions 
across two plates (mean number of measurements per individual = 7.05, 
range = 3-8, excluding failed reactions). The resulting values were expressed 
as the log, transformed ratio of the expression of transcripts carrying one versus 
the other allele at one of the assay SNPs (values based on the alternative assay 
SNP were converted on the basis of linkage between the two sites). 

We identified an effect of one of the upstream cis-regulatory sites by modelling 
variation in allelic imbalance using the following general linear mixed model: 


Vig = BGG t+ Yor bt+e 
where y is allelic imbalance, indexed by individual i and year of sampling j; f is a 


fixed effect of homozygous or heterozygous genotype, Gj Y;is a random effect of 
year of sampling (2006, 2007 or 2008; one individual was sampled in 2005 and 
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grouped with the 2006 samples); b is the intercept; and ¢ is the model error. We 
assessed the significance of / using a permutation test: all measurements for an 
individual were grouped as a block and permuted 1,000 times over individual 
identity. We assigned a P value to the original SNP parameter estimate by rank- 
ing it among the corresponding estimates for the permuted data sets. The esti- 
mate of variance explained by the C/T site is based on modelling the residuals of 
allelic imbalance on year of sampling, using the C/T site alone. 

Three pairs of individuals in the allelic imbalance analysis were related at 
r = 0.5; removing any set of three individuals so that no individuals were closely 
related did not qualitatively change our results (see also Supplementary 
Methods). 
Transfection assays. Human erythroleukaemic cells (HEL 92.1.7) were main- 
tained using the ATCC protocol. The wells of 24-well cell culture plates were 
seeded with 2 X 10° cells in 500 ul media, transfected, incubated for 48h, and 
lysed. Cells were co-transfected with experimental constructs or empty firefly 
luciferase vector as control (pGL4.10; 180 ng per well) and the CMV Renilla 
normalization construct (pGL4.75, 20 ng per well; Promega) using Fugene 6 
(Roche). Expression levels were measured with a dual-luciferase reporter assay 
(DLR1000 assay kit, Promega) and reported as relative ratios of luminescence 
(firefly:Renilla). Eight replicate wells were transfected for each experimental and 
control vector within an assay, with the assay repeated three times (1 = 24 total 
measurements per construct). 

We compared the measurements for each pair of constructs (A versus G for 
the Hepatocystis-associated SNP and C versus T for the SNP associated with 
allelic imbalance) separately using the following model: 


Vij = BC; + Ey + b+e 


where y is the relative ratio of luminescence for replicate i of construct j in 
experiment k; f is a fixed effect of construct, Cj Ex is a random effect of experi- 
ment; b is the intercept; and ¢ is model error. 

Signature of selection. For comparison of the Fy values, we genotyped up to 36 
polymorphic microsatellite loci in ten baboons from the Masai Mara Reserve, 
Kenya; 20 baboons from Mikumi National Park, Tanzania; and 12 baboons from 
Amboseli National Park (Supplementary Information). One locus is a polymorphic 
microsatellite located ~3.7 kb upstream of the sequenced FY cis-regulatory region”’; 
the other 35 loci reside in putatively neutral sites dispersed around the baboon 
genome’~’*. We calculated F,, values for each of these loci independently using 
Arlequin 3.1 (ref. 30), and compared the F,, value for the FY-linked microsatellite 
locus with the Fy values for the 35 neutral loci (Supplementary Figs 3, 4 and 
Supplementary Methods). For comparison of Tajima’s D, we resequenced 21 other 
regions in and around genes in the same or a subset of the individuals resequenced 
at the FY cis-regulatory region (Supplementary Fig. 3 and Supplementary Table 5). 
We calculated Tajima’s D for all loci using the program DnaSP version 4.9 (ref. 27), 
assuming no recombination. 
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Rapamycin fed late in life extends lifespan in 
genetically heterogeneous mice 


David E. Harrison'*, Randy Strong’*, Zelton Dave Sharp’, James F. Nelson’, Clinton M. Astle’, Kevin Flurkey’, 
Nancy L. Nadon”, J. Erby Wilkinson®, Krystyna Frenkel’, Christy S. Carter®+, Marco Pahor*+, Martin A. Javors’, 


Elizabeth Fernandez” & Richard A. Miller!°* 


Inhibition of the TOR signalling pathway by genetic or pharmaco- 
logical intervention extends lifespan in invertebrates, including 
yeast, nematodes and fruitflies'°; however, whether inhibition 
of mTOR signalling can extend lifespan in a mammalian species 
was unknown. Here we report that rapamycin, an inhibitor of the 
mTOR pathway, extends median and maximal lifespan of both 
male and female mice when fed beginning at 600 days of age. On 
the basis of age at 90% mortality, rapamycin led to an increase of 
14% for females and 9% for males. The effect was seen at three 
independent test sites in genetically heterogeneous mice, chosen to 
avoid genotype-specific effects on disease susceptibility. Disease 
patterns of rapamycin-treated mice did not differ from those of 
control mice. In a separate study, rapamycin fed to mice beginning 
at 270 days of age also increased survival in both males and 
females, based on an interim analysis conducted near the median 
survival point. Rapamycin may extend lifespan by postponing 
death from cancer, by retarding mechanisms of ageing, or both. 
To our knowledge, these are the first results to demonstrate a role 
for mTOR signalling in the regulation of mammalian lifespan, as 
well as pharmacological extension of lifespan in both genders. 
These findings have implications for further development of inter- 
ventions targeting mTOR for the treatment and prevention of age- 
related diseases. 

Because incidences of most diseases rise rapidly with age’, interven- 
tions that delay ageing would greatly benefit health’*. So far, dietary 
additives that delay ageing and increase lifespan in rodent models have 
shown only weak effects”. Before clinical studies are considered, anti- 
ageing interventions must be repeatable and effective in many mouse 
genotypes, and not merely postpone strain-specific diseases'’*"*. 

The National Institute on Aging Interventions Testing Program 
(ITP) evaluates agents that may delay ageing and increase lifespan in 
genetically heterogeneous mice’*'’”. Agents are chosen as summarized 
at http://www.nia.nih.gov/ResearchInformation/ScientificResources/ 
InterventionsTestingProgram.htm. Studies are simultaneously repli- 
cated at three test sites: The Jackson Laboratory (TJL), the University of 
Michigan (UM), and the University of Texas Health Science 
Center (UT). BALB/cBy]  C57BL/6J F1 (CB6F1) females and C3H/ 
HeJ X DBA/2J Fl (C3D2F1) males are supplied to each site by The 
Jackson Laboratory, and mated to produce genetically heterogeneous 
populations in which each animal is genetically unique, but a full 


sibling of all other mice in the population'’*. Sufficient mice are used 
to provide 80% power to detect a 10% increase (or decrease) in mean 
lifespan with respect to unmanipulated controls of the same sex, even if 
data from one of the three test sites were to be unavailable. Here we 
report that dietary encapsulated rapamycin increases mouse survival, 
including survival to the last decile, a measure of maximal lifespan. 

Rapamycin reduces function of the rapamycin target kinase TOR 
and has anti-neoplastic activities; genetic inhibition of TOR extends 
lifespan in short-lived model organisms. In male and female mice at 
each of three collaborating research sites, median and maximum life- 
span were extended by feeding encapsulated rapamycin starting at 600 
days of age (Fig. 1). We analysed the data set as of 1 February 2009, 
with 2% (38 of 1,901) of mice still alive. For data pooled across sites, a 
log-rank test rejected the null hypothesis that treatment and control 
groups did not differ (P< 0.0001); mice fed rapamycin were longer 
lived than controls (P < 0.0001) in both males and females. Expressed 
as mean lifespan, the effect sizes were 9% for males and 13% for 
females in the pooled data set. Expressed as life expectancy at 600 days 
(the age of first exposure to rapamycin), the effect sizes were 28% for 
males and 38% for females. Mice treated with other agents (enalapril 
and CAPE (caffeic acid phenethyl ester)) evaluated in parallel did not 
differ from controls at the doses used (Supplementary Fig. 1). 
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Figure 1| Survival plots for male and female mice, comparing control mice 
to those fed rapamycin in the diet starting at 600 days of age, pooling 
across the three test sites. P values were calculated by the log-rank test. 
Four per cent of the control mice and three per cent of rapamycin-assigned 
mice were removed from the experiment for technical reasons. Only five 
animals (three controls, two rapamycin) were removed after the start of 
rapamycin treatment at 600 days. Thus, there were no significant differences 
between groups in censoring. 
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Rapamycin-fed and control mice were then compared separately 
for each combination of site and gender. Rapamycin had a consistent 
benefit, compared with controls, with P values ranging from 0.03 to 
0.0001 (Fig. 2). 

Female mice at all three sites had improved survival after rapamy- 
cin feeding (Fig. 2). Mean lifespan increases for females were 15%, 
16% and 7% (TJL, UM and UT, respectively), and life expectancy at 
600 days increased by 45%, 48% and 22% for females at the three 
sites. Median lifespan estimates of control females were consistent 
across sites (881-895 days), and were similar to values noted in 
Cohort 2004, which ranged from 858 to 909 days'*. Thus, the 
improvement in survival seen in the rapamycin-fed females is not 
an artefact of low survival for the control females. 

Male mice at all three sites also had improved survival after rapa- 
mycin feeding (Fig. 2). Mean lifespan increases for males were 5%, 
8% and 15% (TJL, UM and UT, respectively), and male life expec- 
tancy at 600 days increased by 16%, 23% and 52%. Interpretation is 
complicated by differences among sites in survival of control males, 
and because mice assigned to the rapamycin-fed group at UT and 
perhaps at UM had lower mortality before 600 days than controls. 
Control mice at UT and UM differed from those fed rapamycin not 
only in exposure to rapamycin from 600 days of age but also in 
specific formulation of the mouse chows (all based on the NIH-31 
standard) used between weaning and 600 days. We thus cannot rule 
out the possibility that improved survival among males in the rapa- 
mycin group, at UT and at UM, might reflect differences in nutri- 
tional or health status between control and rapamycin groups before 
600 days, rather than solely the effects of rapamycin. Notably, the 
significant benefits of rapamycin on male (and female) survival at 
TJL could not have been affected by diet before drug administration, 
because at TJL both control and rapamycin-fed mice received the 
same chow (Purina 5LG6) throughout this period. 

Maximum lifespan was increased by rapamycin feeding. Table 1 
shows the ages at the 90th percentile for control and rapamycin- 
treated mice, along with the 95% upper confidence bound for the 
controls. For each site and sex, the 90th percentile age for rapamycin- 
treated mice is higher than the upper limit for the corresponding 
control group, showing that rapamycin increases the age for 90th 
percentile survival. 

To determine whether increases in maximal lifespan due to rapamy- 
cin feeding are statistically significant, we compared the proportion of 
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living mice in each group after 90% had died in the joint life table’ 
(details in Supplementary Table 1). Summing across the three sites, 
4.8% of the female control mice were alive at these ages, compared with 
21.5% of the rapamycin-treated females (P< 0.0001). For males, the 
corresponding values were 5.9% of controls and 20.2% of rapamycin- 
treated mice (P< 0.0001). The site-specific calculations documented a 
significant effect on females at both TJL (P= 0.0006) and UM 
(P= 0.0001); for males, we noted a significant effect at both TJL 
(P= 0.008) and UT (P=0.0001), with a marginal effect at UM 
(P = 0.07). Rapamycin feeding initiated at 600 days of age thus leads 
to a significant increase in maximal lifespan. 

To test if the spectrum of lesions was altered by dietary rapamycin, 
complete necropsies were conducted on 31 control and 40 rapamycin- 
fed mice that were either found dead or killed when moribund (details 
in Supplementary Table 2). Although rapamycin postpones death, it 
did not change the distribution of presumptive causes of death. 

A separate group of mice was used to evaluate the effects of encap- 
sulated rapamycin initiated at 270 days of age (Fig. 3a). At the time of 
analysis, 51% of the females and 68% of the males had died, and a 
stratified log-rank test showed significantly lower mortality risk in 
the rapamycin-treated mice compared to controls, pooling across the 
three test sites (P= 0.0002 for males and P< 0.0001 for females). 
When each site was evaluated separately, the beneficial effect of rapa- 
mycin for females was significant at each site (P< 0.005); for males, 
the effect was significant (P< 0.025) at UM and UT, but not at TJL. 
Rapamycin seems to reduce mid-life mortality risk when started at 
270 days of age, but additional data are needed to provide an accurate 
estimate of effect size, and to evaluate effects on maximal longevity. 

To document biochemical effects of rapamycin at the dose used for 
the lifespan studies, we evaluated the phosphorylation status of 
ribosomal protein subunit S6 (rpS6)—a target substrate of S6 kinase 
1 in the mTOR signalling pathway”°—in visceral white adipose tissue 
(a sensitive indicator of mTOR inhibition by rapamycin treatment 
in vivo). Figure 3b shows that rapamycin feeding reduced the levels of 
phosphorylated rpS6 4—5-fold when fed from 270 to about 800 days 
of age. Blood levels of rapamycin in the treated mice were equivalent 
in males and females, between 60 and 70ngml"'. 

Initial evidence that reduced TOR function can extend longevity 
came primarily from studies in yeast’* and invertebrates*®. 
Beneficial effects of diet restriction?! and dwarf mutations, both of 
which extend lifespan in rodents, may, to some degree, result from 
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Figure 2 | Survival of control and rapamycin-treated mice for males and females for each of the three test sites separately. P values represent results of log- 
rank calculations. Vertical lines at age 600 days indicate the age at which the mice were first exposed to rapamycin. 
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Table 1| The effect of rapamycin on maximum lifespan 
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Comparison Sites Age in days at 90th percentile for Age in days at 90th percentile for Percentage increase 
controls (upper confidence limit)* rapamycin-treated mice 
Females 
Rapamycin versus controls All sites ,094 (1,136) 1,245 14 
Rapamycin versus controls TIL ,100 (1,165) 1,282 17 
Rapamycin versus controls UM ,094 (1,149) 1,250 14 
Rapamycin versus controls UT ,089 (1,159) 1,179 8 
Males 
Rapamycin versus controls All sites ,078 (1,111) 1,179 9 
Rapamycin versus controls TIL ,035 (1,091) 1,142 10 
Rapamycin versus controls UM 141 (1,177) 1,188 4 
Rapamycin versus controls UT ,020 (1,101) 1,179 16 
* The upper limit of the 95% confidence interval for control mice is indicated in parentheses. For example, in the top row, for females pooled across sites, the 95% confidence interval for controls goes 
up to 1,136 days, and the estimate for 90th percentile survival for the rapamycin-treated mice is 1,245 days. This gives good evidence that the 90th percentile survival for rapamycin-treated mice 


(1,245) is substantially above that for controls (1,094). 


repression of the mTOR complex 1 (mTORC1) pathway”. It is not 
yet known to what extent inhibition of mTOR will recapitulate other 
aspects of the phenotypes associated with diet restriction or dwarf 
mutations. Our demonstration that rapamycin feeding increases life- 
span even when started late in life, as well as the absence of changes in 
body weight (data not shown), distinguishes our results from studies 
using diet restriction: in all cases diet restriction reduces body weight, 
and in most reports”, although not all”, diet restriction produces 
little, if any, benefit if started after about 550 days of age. 
Rapamycin may extend lifespan in old, genetically heterogeneous 
mice through a combination of anti-neoplastic effects**”° and effects 
on cellular stress resistance and response to nutrient dynamics*’*’. The 
increase in both median and maximum lifespan seen in rapamycin- 
fed mice is consistent with the hypothesis that inhibiting the mTORC1 
pathway retards mammalian ageing, but is not compelling proof 
that ageing rates are altered; this would require testing whether the 


intervention decelerates age-dependent changes in multiple organs, 
cell types and intracellular and extracellular processes'*. Comparing 
the effects of rapamycin treatment and other models of decelerated 
ageing will help narrow the list of possible mechanisms for longevity 
extension. 

At the cellular level, mTORC1 helps to coordinate growth and 
survival responses induced by alterations in nutrient availability, 
energy status, growth factor stimuli and exposure to potentially lethal 
cell stresses” ”’; this strategic position at the nexus of nutrient/stress 
sensing pathways may contribute to the importance of TOR function 
in regulating lifespan in invertebrates and in mammals as well. It is 
especially noteworthy that rapamycin feeding can extend mouse life- 
span even when started late in life; in terms of the percentage of the 
maximal lifespan, a 600-day-old mouse is roughly the equivalent of a 
60-year-old person™. An effective anti-ageing intervention that could 
be initiated later than the midpoint of the lifespan could prove to be 
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Figure 3 | Characterization of mice receiving rapamycin from 270 days of 
age. a, Survival plots for male and female mice, comparing control mice to 
rapamycin-treated mice of a separate (Cohort 2006) population, in which 
mice were treated with rapamycin from 270 days of age. Because at the time of 
the interim analysis all live mice were between 800 and 995 days of age, we 
have only limited information about the shape of the survival curve at ages 
above 900 days, and the apparent change in slope at the oldest ages 

(>990 days) reflects this experimental uncertainty. P values were calculated 
by the log-rank test. b, Effects of dietary rapamycin on an mTORC1 effector in 
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the visceral fat pads from 750-day-old to 880-day-old male and female mice. 
Ribosomal subunit protein S6 (rpS6) and its phosphorylation status (P-rpS6, 
double arrow) were immunoassayed in tissue lysates prepared from mice 
consuming microencapsulated rapamycin-containing or control diets. 
Antibodies used are shown to the left. The ratio of intensity values for P-rpS6/ 
rpS6 is shown in the graphs for female and male mice. Pan-actin was also 
immunoassayed in the blots to provide an indication of protein loading for 
each lane. c, Whole blood rapamycin content in 750-day-old to 880-day-old 
male and female mice. In b and, error bars show standard errors of the mean. 
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especially relevant to clinical situations, in which the efficacy of anti- 
ageing interventions would be particularly difficult to test in younger 
volunteers. Our data justify special attention to the role of the TOR 
pathway in control of ageing in mammals and in the pathogenesis of 
late-life illnesses. 


METHODS SUMMARY 

Mice. Specific pathogen-free (SPF) mice were produced at each of the three test 
sites by mating CB6F1 females with C3D2F1 males to produce a genetically 
heterogeneous population’’. Weanlings at the three sites were fed similar diets 
(but not identical; see Supplementary Information for details) until they were 
started on food containing rapamycin, in Purina 5LG6, at 600 days of age. A 
second independent study was begun 1 year later, with rapamycin initiated at 
270 days rather than at 600 days of age. The principal endpoint was age at death 
(for mice found dead at daily inspections) or age at euthanasia (for mice deemed 
unlikely to survive for more than an additional 48 h). 

Diet preparation. Rapamycin (from LC Labs) was microencapsulated by 
Southwest Research Institute (San Antonio, Texas), using a spinning disk atomi- 
zation coating process with the enteric coating material Eudragit $100 (Rohm 
Pharma). This coating increased the fraction of rapamycin that survived the food 
preparation process by three- to fourfold, and protected the agent from digestion 
in the stomach’®. Encapsulated rapamycin was then incorporated into 5LG6 
mouse chow and distributed to all three test sites. Rapamycin in blood and diet 
was measured by high-performance liquid chromatography (HPLC) with ultra- 
violet detection as detailed in Supplementary Information. 

Rapamycin effectiveness. To test effects of rapamycin on mTORCI targets, we 
measured phosphorylation of ribosomal protein S6 (at Ser 240 and Ser 244), a 
substrate of S6 kinase 1, in visceral adipose tissue. Adipose tissue was dissected 
from mice that had been fed a rapamycin diet for 420 days; lysates were loaded on 
a 4-12% gradient PAGE and electrophoresed overnight at 5V. After transfer to 
membranes, blocking and incubation with appropriate antibodies, total 
amounts of rpS6 and amounts of phosphorylated rpS6 were quantified by 
chemiluminescence. Techniques are detailed in Supplementary Information. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 

Mouse production, maintenance and estimation of lifespan. Mice were pro- 
duced at each of the three test sites by mating CB6F1 females with C3D2F1 males 
to produce a genetically heterogeneous population. Details of the methods used 
for health monitoring were provided previously’; in brief, each of the three 
colonies was evaluated four times each year for infectious agents, including 
pinworm. All such tests were negative throughout the entire study period. 
Each test site enrolled approximately equal numbers of 19- to 21-day-old weanl- 
ings each month over a 6-month period, housing three males or four females per 
cage. Each site used diets that the manufacturer stated were based on the NIH-31 
standard for breeding cages, and the period between weaning and the initiation 
of experimental diets is given below. For breeding cages, UM used Purina 5008, 
UT used Teklad 7912 and TJL used Purina 5K52. For weanlings before 120 days 
of age, UM used Purina 5008, UT used Teklad 7912 and TJL used Purina 5LG6. 
Starting when 120 days old, mice in the control, enalapril and CAPE groups 
received Purina 5LG6 at all three sites, without additives (control group) or with 
the test agent. Mice in the rapamycin group remained on the weanling diet until 
they began to receive rapamycin, in Purina 5LG6, at 600 days of age. Separate 
cohorts of control and rapamycin-treated mice were established in the same way 
1 year later, again at each test site, but with rapamycin initiated at 270 days rather 
than at 600 days of age. Additional husbandry details, including accounts of tests 
for T-cell subset distribution and activity administered to a subset of each group, 
are provided elsewhere'®'”. 

Removal of mice from the longevity population. The Cohort 2005 study popu- 
lation, distributed almost equally among the three test sites, consisted initially of 
1,960 mice, of which 674 were assigned to the control group and 317-328 to each 
of the four treatment groups. Of these, 51 mice were removed from the study 
because of fighting (31 mice), accidental death (such as chip implantation or 
cage flooding; 13 mice), or because of technical error (error in gender assignment 
or diet selection; 7 mice). For survival analyses, mice were treated as alive at the 
date of their removal from the protocol, and lost to follow-up thereafter. These 
censored mice were not included in calculations of median longevity. 
Estimation of age at death (lifespan). Mice were examined at least daily for signs 
of ill health, and were killed for humane reasons if they were so severely mori- 
bund that they were considered, by an experienced technician, unlikely to survive 
for more than an additional 48 h. A mouse was considered severely moribund if it 
showed more than one of the following clinical signs: (1) inability to eat or to 
drink; (2) severe lethargy, as indicated by a lack of response such as a reluctance 
to move when gently prodded with a forceps; (3) severe balance or gait distur- 
bance; (4) rapid weight loss over a period of 1 week or more; or (5) a severely 
ulcerated or bleeding tumour. The age at which a moribund mouse was killed 
was taken as the best available estimate of its natural lifespan. Mice found dead 
were also noted at each daily inspection. Bodies were fixed for later necropsy 
analysis. 

Control and experimental diets. TestDiet (Richmond) prepared batches of 
Purina 5LG6 food containing each of the test substances, as well as control diet 
batches, at intervals of approximately 120 days, and shipped each batch of food at 
the same time to each of the three test sites. Enalapril was purchased from Sigma 
(catalogue E6888-5G) and used at 120 mg per kg food; on the assumption that 
the average mouse weighs 30g and consumes 5g of food per day, this dose 
supplies 20 mg enalapril per kg body weight per day. CAPE, caffeic acid phe- 
nethyl ester, was purchased from Cayman (catalogue 70750), and used at either 
of two doses: the high dose was 300 mg per kg food (50 mg per kg body weight per 
day), and the low dose was 30 mg per kg food (5 mg per kg body weight per day). 
Enalapril was tested because in aged humans and in rodent models of hyperten- 
sion, obesity, diabetes and congestive heart failure, it has been reported to 
improve many of these conditions. CAPE was tested because this agent has been 
reported to possess antioxidant, anti-inflammatory and immunomodulatory 
capabilities, as well as specific toxicity to transformed and tumour cells. 
Lifespans of mice given enalapril or CAPE are compared with controls and those 
given rapamycin in Supplementary Fig. 1. Rapamycin was purchased from LC 
Labs. The rapamycin was microencapsulated by Southwest Research Institute 
(San Antonio, Texas), using a spinning disk atomization coating process with the 
enteric coating material Eudragit $100 (R6hm Pharma). This thermoplastic 
coating material increased the fraction of rapamycin that survived the food 
preparation process by three- to fourfold’*. Because the coating material is water 
soluble only in non-acidic conditions, the encapsulated rapamycin is released in 
the small intestine rather than in the stomach. A pilot study showed that encap- 
sulated rapamycin led to blood concentrations approximately tenfold higher 
than achieved by equivalent doses of non-encapsulated rapamycin’®. The encap- 
sulated rapamycin was administered at 14 mg per kg food (2.24 mg of rapamycin 
per kg body weight per day). 
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Measurement of rapamycin. Rapamycin was obtained from LC Laboratories. 
32-desmethoxyrapamycin (32-RPM) was obtained from Sigma Chemical 
Company. HPLC grade methanol and acetonitrile were purchased from 
Fisher. All other reagents were purchased from Sigma Chemical Company. 
Milli-Q water was used for preparation of all solutions. The HPLC system con- 
sisted of a Waters 510 HPLC pump, Waters 717 autosampler, Waters 2487 UV 
detector, and Waters Empower chromatographic software (Waters). The HPLC 
analytical column was a Grace Alltima C18 (4.6 X 150mm, 5 um) purchased 
from Alltech. The mobile phase was 64% (v/v) acetonitrile and 36% water. The 
flow rate of the mobile phase was 1.5 ml min” | and the wavelength of absorbance 
was 278 nm. The temperature of the HPLC analytical column was maintained at 
70°C during the chromatographic runs using an Eppendorf CH-30 column 
heater. 

Rapamycin and 32-RPM powder were dissolved in methanol at a concentra- 
tion of | mg ml | and stored in aliquots at —80 °C. A working stock solution was 
prepared each day from the methanol stock solutions at a concentration of 
lpg ml ' and used to spike the calibrators. Calibrator samples were prepared 
daily by spiking either whole blood or mouse food with stock solutions to achieve 
final concentrations of 0, 4, 8, 12, 24, 100 and 200 ng ml. 

Rapamycin was quantified in mouse blood using HPLC with ultraviolet detec- 
tion. Briefly, 0.5 ml of calibrators and unknown samples were mixed with 75 ull of 
1.0pgml~' 32-desmethoxy rapamycin (internal standard), 1.0ml ZnSO4 
(50g1~') and 1.0ml of acetone. The samples were vortexed vigorously for 
20s, then centrifuged at 2,600g at 23°C temperature for 5 min (subsequent 
centrifugations were performed under the same conditions). Supernatants were 
transferred to clean test tubes, then 200 pl of 100 mM NaOH was added, followed 
by vortexing. Then, 2 ml of 1-chlorobutane was added and the samples were 
capped, vortexed (1 min), and centrifuged. The supernatants were transferred to 
10 ml glass tubes and dried to residue under a stream of nitrogen at ambient 
temperature. The dried extracts were dissolved in 750 ul of mobile phase and 
then 2 ml of hexane was added to each tube. The tubes were capped, vortexed for 
30s, and centrifuged for 2 min. The hexane layers were removed and discarded. 
The remaining extracts were dried under nitrogen and reconstituted in 250 ul of 
mobile phase, and then 200 tll of the final extracts were injected into the HPLC. 
The ratio of the peak area of rapamycin to that of the internal standard (response 
ratio) for each unknown sample was compared against a linear regression of 
calibrator response ratios to quantify rapamycin. The concentration of rapamy- 
cin was expressed as ng per ml whole blood. 

Rapamycin content of mouse chow was verified using HPLC with ultraviolet 

detection. Briefly, 100 mg of chow for spiked calibrators and unknown samples 
was crushed with a mortar and pestle, then vortexed vigorously with 20 pl of 
100 kg ml”! 32-RPM (internal standard) and 0.5 ml methanol. The samples were 
then mechanically shaken for 10 min. Next, 0.5 ml of Millipore water was added 
and the samples were vortexed vigorously for 20 s. The samples were centrifuged 
for 10 min and then 40 kl were injected into the HPLC. The ratio of the peak area 
of rapamycin to that of the internal standard (response ratio) was compared 
against a linear regression of calibrator response ratios at rapamycin concentra- 
tions of 0, 2, 4, 8, 10 and 20ng mg | of food to quantify rapamycin. The con- 
centration of rapamycin in food was expressed as ng per mg food (parts per 
million). 
Rapamycin effectiveness. To assay for the status of an mTORC1 downstream 
effector, we measured phosphorylation of ribosomal protein S6 (Ser 240 and Ser 
244), a substrate of S6 kinase 1, in visceral adipose tissue lysates in mice fed an 
encapsulated rapamycin diet for 420 days or a control diet with empty micro- 
capsules. Tissues were dissected and snap frozen in liquid nitrogen for storage at 
—80 °C, ground into powder under liquid nitrogen and dissolved in 10 volumes 
of buffer (50mM Tris-HCl (pH 7.5), 120 mM NaCl, 1% NP-40, 1mM EDTA, 
50mM Naf, 40 mM 2-glycerophosphate, 0.1 mM Na orthovanadate (pH 10), 
1mM benzamidine, and 1X Complete protease inhibitor cocktail (Roche)). 
After sonication and microcentrifugation, lysates were quantified”. Forty micro- 
grams of soluble protein from each extract was loaded on a 4—12% gradient PAGE 
and electrophoresed overnight at 5V. Gels were then transferred to nitrocellulose 
membranes (dry procedure), blocked and incubated with the primary antibodies 
(S6 ribosomal protein (5G10) rabbit monoclonal antibody, catalogue number 
2217; phospho-S6 ribosomal protein (Ser235/236) antibody, catalogue number 
2215; pan-actin antibody, catalogue number 4968; Cell Signaling Technologies), 
followed by secondary antibody (anti-rabbit IgG (H+L) peroxidase conjugated 
antibody, catalogue number 31460; Pierce) for detection by chemiluminescence. 
Signal intensities for each immunoblot were captured using a Kodak Image 
Station, and these were analysed using Kodak 1D image analysis software. 


30. Bradford, M. M. A rapid and sensitive method for the quantitation of microgram 
quantities of protein utilizing the principle of protein-dye binding. Anal. Biochem. 
72, 248-254 (1976). 
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A conserved ubiquitination pathway determines 
longevity in response to diet restriction 


Andrea C. Carrano’, Zheng Liu”, Andrew Dillin’**> & Tony Hunter’ 


Dietary restriction extends longevity in diverse species, suggesting 
that there is a conserved mechanism for nutrient regulation and 
prosurvival responses’. Here we show a role for the HECT (homo- 
logous to E6AP carboxy terminus) E3 ubiquitin ligase WWP-1 as 
a positive regulator of lifespan in Caenorhabditis elegans in 
response to dietary restriction. We find that overexpression of 
wwp-1 in worms extends lifespan by up to 20% under conditions 
of ad libitum feeding. This extension is dependent on the FOXA 
transcription factor pha-4, and independent of the FOXO tran- 
scription factor daf-16. Reduction of wwp-1 completely suppresses 
the extended longevity of diet-restricted animals. However, the 
loss of wwp-1 does not affect the long lifespan of animals with 
compromised mitochondrial function or reduced insulin/IGF-1 
signalling. Overexpression of a mutant form of WWP-1 lacking 
catalytic activity suppresses the increased lifespan of diet- 
restricted animals, indicating that WWP-1 ubiquitin ligase acti- 
vity is essential for longevity. Furthermore, we find that the E2 
ubiquitin conjugating enzyme, UBC-18, is essential and specific 
for diet-restriction-induced longevity. UBC-18 interacts with 
WWP-1 and is required for the ubiquitin ligase activity of 
WWP-1 and the extended longevity of worms overexpressing 
wwp-1. Taken together, our results indicate that WWP-1 and 
UBC-18 function to ubiquitinate substrates that regulate diet- 
restriction-induced longevity. 

HECT E3 ligases promote the ubiquitination of proteins that are 
essential in a variety of cellular events. The mammalian WWPI1, 
WWP2 and ITCH family of WW domain HECT ligases (WWP 
ligases) were initially identified in a search for new proteins contain- 
ing WW domains, which are modular protein-interaction domains 
recognizing short proline motifs in their partners’. WWP ligases have 
an amino-terminal C2 domain, which is a phospholipid membrane 
interaction motif, followed by four WW domains. To identify cellular 
pathways in which WWP E3 ligases are required, we have taken 
advantage of C. elegans as a model organism, which contains a single 
HECT WWP £3 ligase orthologue, wwp-1 (Y65B4BR.4). Disruption 
of wwp-1 using RNA interference (RNAi) yields a lethal phenotype 
late in embryogenesis characterized by abnormal embryogenesis 
despite normal cell proliferation’. The wwp-1(0k1102) mutant allele 
has a partially penetrant embryonic lethal phenotype’. Independent 
of the early developmental function of wwp-1, we found that the loss 
of wwp-1 decreased stress resistance during adulthood (Sup- 
plementary Fig. la, b, e), leading us to investigate a possible role in 
longevity. Loss of wwp-1 function by RNAi or mutation reduced 
lifespan at 25°C (Supplementary Fig. 2a, b), but not at 20°C 
(Supplementary Fig. 3a, b), consistent with a role for wwp-1 in stress 
resistance. To investigate whether increased expression of wwp-1 
extended longevity in N2 (wild-type) worms, we created stable trans- 
genic lines that express an N-terminal green fluorescent protein 


(GEP)—WWP-1 fusion protein, under the control of the endogenous 
wwp-1 promoter in which expression of wwp-1 messenger RNA is 
increased by approximately 50% (Supplementary Fig. 4). The over- 
expressing wwp-1 transgenic lines (GFP::WWP-1) lived up to 20% 
longer than controls expressing GFP under the same promoter 
(Fig. la), indicating that wwp-1 is a positive regulator of lifespan. 

When diet is restricted, lifespan is extended in diverse species, 
suggesting that there is a conserved mechanism for nutrient regu- 
lation of ageing. Dietary restriction in worms can be reproduced 
genetically using eat-2(ad1116) mutant worms”®. Reduced levels of 
wwp-1 completely suppressed the extended longevity of eat-2 mutant 
animals (Fig. 1b). The suppression of diet-restriction-extended life- 
span by wwp-I depletion is unlikely to be due to increased food 
intake, because no difference in pharyngeal pumping rates with loss 
or knockdown of wwp-1 in N2 or eat-2(ad1116) worms was observed 
(Supplementary Table 4). 

We tested whether the loss of wwp-1 suppressed the extended 
longevity of diet-restricted animals by reduced food intake imposed 
by bacterial dilution in liquid culture. N2 animals exhibited a bell- 
shaped curve for lifespan in response to varying bacterial concentra- 
tions (Fig. 1c)’°. The lifespan of N2 animals grown under diet- 
restricted conditions was more than double that of animals fed ad 
libitum (Fig. 1c, d). In contrast, the lifespan of wwp-1(0k1102) 
mutant worms across the entire food concentration range did not 
change noticeably (Fig. 1c, d), indicating that WWP-1 has an essen- 
tial role in regulating the response to nutrient intake and longevity. 
Similar results were seen using another method of dietary restriction, 
solid-plate dietary restriction’’ (Supplementary Fig. 5). To determine 
whether dietary restriction could affect wwp-1 expression, we used 
quantitative PCR (qPCR) to quantify wwp-1 mRNA and found no 
difference in wwp-1 expression in animals grown under diet-restricted 
and ad libitum conditions (Supplementary Fig. 6). Furthermore, 
expression of a GFP:: WWP-1 transgene partially rescued the suppres- 
sion of dietary restriction longevity in wwp-1(ok1102) mutants 
(Supplementary Figs 4 and 7). Because the loss of wwp-1 prevented 
the extension of lifespan of animals grown using three different diet- 
restriction methods, we conclude that wwp-I1 is essential for the 
increased longevity response to dietary restriction. 

The FOXA transcription factor PHA-4 is required to specifically 
mediate diet-restriction-induced longevity in C. elegans’. RNAi 
reduction of pha-4 suppressed the increased longevity of worms 
overexpressing wwp-1 (Fig. le), but not when these worms were 
fed bacteria expressing double-stranded (ds)RNA against daf-16— 
the forkhead transcription factor required for the increased longevity 
due to reduced insulin/IGF1 signalling’ (Fig. 1f). Mutations in the 
iron sulphur component of complex III, isp-1, increase longevity by 
reducing mitochondrial function'*"’. wwp-1 RNAi did not suppress 
the extended lifespan of isp-1(qm150) mutant animals (Fig. 1g), and 


'Molecular and Cell Biology Laboratory, The Salk Institute for Biological Studies, La Jolla, California 92037, USA. 7The Howard Hughes Medical Institute, The Salk Institute for Biological 
Studies, 10010 North Torrey Pines Road, La Jolla, California 92037, USA. *The Glenn Center for Aging Research, 10010 North Torrey Pines Road, La Jolla, California 92037, USA. 


396 


©2009 Macmillan Publishers Limited. All rights reserved 


NATURE| Vol 460|16 July 2009 


N2 vector — 


eat-2(ad1116) — 
vector 

eat-2(ad1116) — 

wwp-1 RNAi 


204 —GFP 
—GFP::WWP-1 (line 1) 
—GFP::WWP-1 (line 2) 


0 0 
0 5 10 15 20 25 30 35 ) 10 20 30 40 
Time (days) Time (days) 
© 70 d 100 
@ 60 wwp-1(0k1102) AL— 
S 80 wwp-1(0k1102) DR — 
BS 50 are = N2 AL— 
g 40 — wwp-1(0k1102) & 60 N2DR— 
oO 
8 2 
2 30 = 40 
3 . 20 
= 10 
0 0 
0.01 0.1 1 10 100 0 10 20 30 40 50 60 70 80 
E. coli conc (108 cells per ml) Time (days) 
& 100 f 
80 
S 60 
oO 
2 
< 407 — Grp 


i 


10 15 20 25 30 35 
Time (days) 


ae 0 
0 5 10 15 20 25 30 35 0 5 


Time (days) 
h 100 


80 80 
60 60 
2 2 

< 40 z 40 


— Vector 
— wwp-7 RNAi 
— daf-16 RNAi 


—isp-1(qm150) 
wwp-1 RNAi 


0 10 20 30 40 50 0 10 20 30 40 50 
Time (days) Time (days) 

Figure 1| wwp-1 is required and specific for the extension of lifespan by 
dietary restriction. Lifespan values are given in Supplementary Tables 1 and 
2. Two-way analysis of variance (ANOVA) results are presented in 
Supplementary Table 7. Supplementary Fig. 15a, c shows data confirming 
specific knockdown of wwp-1 expression by RNAi. a, Two independent 
wwp-1 overexpressing strains (GFP::WWP-1) can extend longevity 
compared to control worms expressing GFP. b, Lifespan analysis of 
eat-2(ad1116) mutant animals fed bacteria expressing wwp-1 dsRNA or 
control vector. ¢, Lifespans of N2 and wwp-1(0k1102) mutant worms grown 
in S basal buffer with different E. coli concentrations. d, Lifespan analysis of 
N2 and wwp-1(ok1102) mutant worms grown in diet-restricted (DR) or ad 
libitum (AL) E. coli concentrations. e, f, Lifespan analysis of wwp-1 
overexpressing worms (GFP::WWP-1) or a control line fed bacteria 
expressing pha-4 dsRNA (e), or daf-16 dsRNA (f). g, h, Lifespan analysis of 
isp-1(qm150) (g) and daf-2(e1368) (h) fed bacteria expressing wwp-1 
dsRNA or control vector. 


had only minor suppressive effects on lifespan extension of another 
mitochondrial mutant, clk-1(qm30), and in cyc-1 RNAi-treated 
worms (Supplementary Fig. 8). Partial loss of function mutations 
in the insulin/IGF-1 receptor homologue, DAF-2, increase lifespan 
in a daf-16-dependent, pha-4-independent manner*’®. RNAi deple- 
tion of wwp-1 had no effect on the long lifespan of daf-2 mutant 
animals (Fig. 1h and Supplementary Fig. 9a). Our results indicate 
that loss of wwp-1 does not make animals sick, but rather specifically 
regulates the response to dietary restriction that results in extended 
longevity. 

Ubiquitination by HECT ligases requires the intermolecular transfer 
of ubiquitin from an associated E2 to the E3 ligase, before transfer to a 
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lysine in the target protein'’. These transfers depend on the formation 
of a thioester bond between ubiquitin and a conserved cysteine in the 
HECT domain. Mutation of this cysteine renders HECT ligases cata- 
lytically inactive, and the mutants act as dominant negatives in vivo'®. 
We established an in vitro ubiquitination assay for WWP-1 ligase 
activity using C. elegans embryo extract as a source of substrates. In 
the presence of extract, bacterially expressed glutathione S-transferase 
(GST)-WWP-1 had very robust ligase activity, which was abolished 
by mutation of the catalytic cysteine (C762A) of WWP-1 (Fig. 2a). We 
then compared the longevity of eat-2(ad1116) transgenic animals that 
overexpress a GFP—WWP-1(C762A) fusion protein driven by the 
wwp-1 promoter to a control line expressing GFP under the same 
promoter. Two independent eat-2(ad1116) transgenic lines expressing 
the dominant negative wwp-I had a significantly shorter lifespan, 
comparable to wild-type animals (Fig. 2b). Therefore, the ubiquitin 
ligase activity of WWP-1 is essential for diet-restriction-induced 
longevity. 

UBC-18 is a putative E2 that regulates pharyngeal morphogenesis 
during early embryonic development'”*'. UBC-18 is homologous to 
human UBCH7 (also known as UBE2L3), and similar to 
Saccharomyces cerevisiae Ubc5 and Ubc4 (ref. 21). Recently, a two- 
hybrid screen using UBC-18 as a bait identified WWP-1 and the RING 
finger E3 ligases ARI-1 and F56D2.2 as UBC-18 interactors’’. Unlike 
ubc-18 and ari-1 dsRNA, inactivation of wwp-1 by dsRNA treatment 
failed to produce a pharynx-unattached phenotype in pha-1(e2123) 
animals, suggesting that WWP-1 may function with UBC-18 to ubi- 
quitinate targets not involved in pharyngeal development’. 
Consistent with this, RNAi of either ari-1 or F56D2.2 did not affect 
the long lifespan of the eat-2 mutant animals (Supplementary Fig. 10). 

We found that UBC- 18 is indeed a functional E2. UBC-18 formed 
a thiol ester bond with ubiquitin (Fig. 3a), and recombinant WWP-1 
ubiquitin ligase activity required UBC-18 and El in vitro (Fig. 3b). 
Extracts prepared from worms mutant for ubc-18, ubc-18(ku354)”', 
greatly reduced WWP-1-dependent ubiquitin ligase activity, which 
was restored by the addition of recombinant UBC-18 (Fig. 3b). 
Finally, we confirmed that UBC-18 and WWP-1 associate in vitro 
(Fig. 3c). 

We tested whether ubc-18 was essential for diet-restriction- 
induced longevity. Like wwp-1, ubc-18 has a role in stress resistance 
in C. elegans (Supplementary Fig. 1c—-e). However, we found that 
overexpression of ubc-18 was unable to extend lifespan in C. elegans 
(Supplementary Fig. 11). Possibly, UBC-18 is not limiting for WWP-1 
function in lifespan. Loss of ubc-18 function reduced lifespan at 25 °C 
(Supplementary Fig. 2c), but only slightly at 20°C (Supplementary 
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Figure 2 | WWP-1 ubiquitin ligase activity is essential for diet-restriction- 
induced longevity. a, Mutation of the conserved catalytic cysteine of WWP-1 
abolishes ubiquitin (Ub) ligase activity. In vitro ubiquitination assay of 
recombinant wild-type (WT) WWP-1 or mutant WWP-1(C762A) using 

C. elegans embryo extract. b, eat-2(ad1116) mutant worms expressing a 
dominant negative wwp-1(C762A) have significantly shorter lifespans than 
control worms expressing GFP. Lifespan values are given in Supplementary 
Table 1. 
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Figure 3 | WWP-1 exhibits ubiquitin ligase activity in a UBC-18 dependent 
manner in vitro. a, UBC-18 forms thiol-sensitive adducts with ubiquitin. In 
vitro ubiquitin (Ub) conjugation reaction in which samples were subjected 
to SDS-PAGE with or without f-mercaptoethanol (f-merc). b, UBC-18 is 
essential for ubiquitin ligase activity in vitro. Left panel, in vitro 
ubiquitination assay of wild-type WWP-1 using N2 or ubc-18 mutant (ubc- 
18(ku354)) embryo extract. Recombinant UBC-18 was added in the last lane. 
Right panel, in vitro ubiquitination assay using purified components. ¢, GST 
pull-down assay in which GST-WWP-1 (or GST alone) bound to 
glutathione-agarose beads was incubated with cell lysates expressing ubc-18. 


Fig. 3c, d). However, RNAi depletion of ubc- 18 completely suppressed 
the increased longevity of eat-2 mutants (Fig. 4a). This decreased 
lifespan is unlikely to be due to impaired pharynx function, because 
RNAi was initiated at the L1 stage when the pharynx is completely 
developed, and ubc-18 RNAi initiated at the first day of adulthood also 
suppressed the increased longevity of eat-2(ad1116) animals. Like 
wwp-1 depletion, we did not see a difference in pharyngeal pumping 
rates with loss of ubc-18 (Supplementary Table 4), and RNAi depletion 
of ubc-18 had no effect on the long lifespan of isp-1(qm150) (Fig. 4b) 
or daf-2 mutant animals (Fig. 4c and Supplementary Fig. 9b). 
Moreover, epistasis analysis of wwp-1 and ubc-18 indicated that the 
combined knockdown of both genes by RNAi in eat-2(ad1116) 
animals did not shorten lifespan any further than RNAi of either single 
gene (Fig. 4d). Finally, knockdown of ubc-18 suppressed the extended 
lifespan of wwp-1 overexpressing animals (Fig. 4e). 

In summary, the UBC-18-WWP-1 complex functions to specify 
the longevity response of diet-restricted animals. Because E2s often 
function with several E3s, it is surprising to find that ubc-18 was not 
only essential, but also specific for the response to dietary restriction. 
M7.1 (also known as UBC-2 and LET-70) is most homologous to 
UBCH5 (also known as UBE2D1), a mammalian E2 that associates 
with HECT ubiquitin ligases. Unlike ubc-18, loss of ubc-2 did not 
specifically suppress the extended longevity of eat-2 mutants and 
resulted in general sickness of animals (Supplementary Fig. 12). 
The other E3s that interact with UBC-18 may be dedicated instead 
to the developmental function of UBC-18, as is the case for ARI-1 
(ref. 19). It is interesting that expression of WWP-1 and UBC-18 is 
observed in several neurons localized in the head and tail of adult 
animals (Supplementary Fig. 13), because many recent studies in 
C. elegans and Drosophila suggest that signals derived from the 
nervous system can control longevity” ~*°. Although it is intriguing 
to speculate that a few key neuronal cells in the nervous system are the 
site of action of WWP-1—-UBC- 18 to regulate longevity, expression is 
not confined to a few neurons, as is the case for the dietary restriction 
regulator, SKN-1B’. Furthermore, expression of WWP-1—-UBC-18 is 
found in intestinal cells, another site where longevity cues are 
expressed in the worm”. 

Because several transcription factors have been identified as targets 
for the mammalian orthologues of wwp-1 (ref. 17), we investigated 
whether WWP-1 may target one of the two transcription factors 
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Figure 4 | WWP-1 and UBC-18 function together to regulate diet- 
restriction-induced longevity. Lifespan values are given in Supplementary 
Table 1. Two-way ANOVA results are presented in Supplementary Table 7. 
Knockdown of ubc-18 expression by RNAi is shown in Supplementary Fig. 
15d. a, Lifespan analysis of eat-2(ad1116) mutant worms fed bacteria 
expressing ubc-18 dsRNA or control vector initiated after hatching of eggs 
(L1) or day 1 adults (D1). b, ¢, Lifespan analysis of isp-1(qm150) (b) and daf- 
2(e1368) (c) fed bacteria expressing ubc-18 dsRNA or control vector. 

d, Lifespan analysis of eat-2(ad1116) mutant animals fed bacteria expressing 
wwp-I dsRNA and vector (wwp-1 RNAi), ubc-18 dsRNA and vector (ubc-18 
RNAi), wwp-1 and ubc-18 dsRNA (wwp-1 + ubc-18 RNAi), or control 
vector. e, Lifespan analysis of wwp-1 overexpressing worms (GFP::WWP-1) 
or control worms (GFP) fed bacteria expressing ubc-18 dsRNA or control 
vector. 


essential for dietary restriction longevity in the worm: PHA-4 and 
SKN-1B”*. The genetic epistasis analysis of ubc-18/wwp-1 suggested 
that PHA-4 may be a target for ubiquitination. Using our in vitro 
ubiquitination assay for WWP-1, we were unable to detect ubiquiti- 
nated conjugates for either PHA-4 or SKN-1B in a purified system 
(Supplementary Fig. 14). Recently it has been shown that pha-4 and 
the CeTor (also known as let-363) pathway antagonize one another to 
regulate longevity in adults’’. Our results might suggest that wwp-1 
may feed into the CeTor pathway as well. The identification of the 
targets of UBC-18-WWP-1 is needed to allow precise placement of 
this complex in the diet-restriction pathway. 

Our study uncovers for the first time, to our knowledge, a role of 
the ubiquitin pathway in longevity in response to dietary restriction. 
Given the strong conservation of wwp-1 with mouse and human 
WWPYI, an attractive hypothesis is that the mammalian orthologue 
will also be critical for diet-restriction-induced longevity. A detailed 
understanding of the pathways that mediate the benefits of dietary 
restriction may lead to new therapies for age-related diseases. 


METHODS SUMMARY 

Caenorhabditis elegans methods. The wwp-1 mutant strain was generated by 
backcrossing RB1178 (wwp-1(ok1102)) to N2 three times (Supplementary Fig. 
15b). Nematodes were handled using standard methods”. 


©2009 Macmillan Publishers Limited. All rights reserved 


NATURE| Vol 460|16 July 2009 


Lifespan analysis. Lifespan analyses were performed as described”’. Bacterial 
dilution diet-restricted lifespans were performed as described® with the following 
modifications: synchronized populations of eggs were hatched and grown at 
20°C on NG agar plates containing OP50 E. coli until the L4 larval stage when 
they were transferred to plates of OP50 containing 100 pg ml ' FUDR. At day 1 
of adulthood, worms were transferred into liquid culture. All lifespans were 
performed at 20 °C unless noted. 

Protein extraction for in vitro ubiquitination assay. Caenorhabditis elegans 
embryos were isolated using an alkaline hypochlorite solution from gravid N2 
worms grown at 20 °C**. The embryos were resuspended in lysis buffer (50 mM 
Tris-HCl, pH 7.5, 0.75 mM EDTA, 1.5 mM dithiothreitol (DTT), 2.5 mM PMSF, 
1pgml”' aprotinin, 1 :gml~! leupeptin, 1 pgml~' pepstatin A) and homo- 
genized with 30 strokes in a Dounce homogenizer. The extract was centrifuged 
15,000g at 4°C and stored at —70 °C. 

In vitro ubiquitination assay. The ubiquitination assay was carried out by 
incubating lpg Flag-tagged ubiquitin, 0.5~1g GST-WWP-1 (wild-type or 
C762A mutant), 0.1 ug UBC-18 and 15-20 1g embryo extract in 30 ul reaction 
buffer (50 mM Tris-HCl, pH 7.5, 5 mM MgCh, 1 mM DTT and5 mM ATP) for 1h 
at 30°C. The reaction was stopped with sample buffer and run on denaturing 
protein gels. Ubiquitinated substrates were identified by anti-Flag (M2, Sigma) 
immunoblotting. For in vitro ubiquitination assays using purified components, 
similar conditions were administered except 0.5 ug UBC-18 and 1 jg El were used. 
To measure UBC-18 ubiquitin conjugation, a similar reaction was performed and 
the reaction was stopped with sample buffer lacking B-mercaptoethanol. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 

Statistical analysis. JMP IN 5.1 software was used to determine means and 
percentiles. P values were calculated using the log-rank (Mantel-Cox) method. To 
assess differences in mortality between N2 and wwp-1(ok1102) strains in diet- 
restriction lifespans, a Cox proportional hazards regression model was generated 
in JMP IN 5.1 software. Two-way ANOVA was performed using Prism Version 5.0 
software. Mean lifespan*+s.em. and n of all experiments presented in 
Supplementary Tables 1 and 5 were considered with strain and RNAi treatment 
as the factors. 

Creation of WWP-1 and UBC-18 constructs for expression in C. elegans. To 
construct the plasmid expressing GFP::WWP-1 driven by the wwp-1 endogenous 
promoter, sequences 644 bases upstream of the wwp-1 coding region were amp- 
lified from genomic DNA by PCR and inserted upstream of GFP sequences in 
the worm expression vector pPD95.77. This construct (Pwwp-1::GFP), which 
expresses GFP under the control of the endogenous wwp-1 promoter, was used as 
a control. To generate an N-terminal GFP-fusion construct, point mutagenesis 
(Stratagene) was performed on the GFP construct to eliminate the stop codon in 
the GFP complementary DNA. Full-length wwp-1 cDNA was amplified from a 
first-strand cDNA from N2 worms by PCR, and inserted into the N-terminal 
GFP-fusion construct containing the wwp-1 endogenous promoter downstream 
and in frame with the GFP sequence. This construct (Pwwp-1::GFP::WWP-1) 
expresses GFP—WWP- fusion protein under the control of the wwp-1 promoter. 
To generate wwp-1(C762A) dominant negative mutants, point mutagenesis was 
performed on the Pwwp-1::GFP::;WWP-1 construct to generate Pwwp- 
I::GFP::WWP-1(C762A). Primers for the wwp-1 promoter: forward, 5'-GCT 
CTAGACTTGTTTCCTGATGACCTTG-3’; reverse, 5'’-CGGGATCCTCGATC 
ATGAAACTGGCTG-3’. Primers for wwp-1 cDNA: forward, 5'-ATGGCTCG 
TAATGAACCATCATCTCAGCAG-3’; reverse, 5’-CTACTCGTTTCCAAATCC 
TTCCGTCATCTC-3’. To construct the plasmid expressing GFP driven by the 
ubc-18 endogenous promoter (Pubc-18::GFP), sequences 911 bases upstream of 
the ubc-18 coding region were amplified from genomic DNA by PCR and 
inserted upstream of GFP sequences in the worm expression vector pPD95.67. 
To construct the plasmids expressing UBC-18 driven by the ubc-18 promoter, 
sequences from 911 bases upstream of the ubc-18 coding region to the stop 
codon of the ubc-18 coding region were amplified from genomic DNA by 
PCR and inserted upstream of the GFP sequence in the worm expression vector 
pPD95.77. Primers for ubc-18 promoter: forward, 5'-GATCTTTC 
TACTGTAGAC-3’; reverse, 5’‘-CGTTGATATAAACGCTCTAG-3’. Primers to 
generate Pubc-18::UBC-18::GFP: forward, 5'’-GATCTTTCTACTGTAGAC-3’; 
reverse, 5’-GCGTATTCAGGCCGCTTTTC-3’. Primers to generate Pubc- 
18:UBC-18: forward, 5’'-GATCTTTCTACTGTAGAC-3’; reverse, 5’-CTA 
TTCAGGCCGCCGCTTTTC-3’. 

Generation of transgenic lines. Extrachromosomal array carrying transgenic 
strains were generated using standard microinjection methods". For generation 
of N2 animals overexpressing wwp-1, Pwwp-1::GFP::WWP-1 plasmid DNA con- 
struct was mixed at 20 1g ml! with 80 ug ml | of pRF4(rol-6) construct*!. wwp- 
1(0k1102) mutant animals overexpressing wwp-1 were generated by crossing 
wwp-1(ok1102) mutants to Pwwp-1::GFP::WWP-1 males. For generation of 
eat-2(ad1116) worms overexpressing dominant negative WWP-1(C762A), 
Pwwp-1::GEP::WWP-1(C762A) plasmid DNA was mixed at 30pg ml! with 
80 ug ml” | of pRF4 construct. Worms used as controls against wwp-1 over- 
expressing strains contained 30,gml-' Pwwp-I::GEP expressing construct 
injected with 30 pg ml_' pRF4. For generation of N2 animals overexpressing 
ubc-18, Pubc-18::UBC-18::GFP or Pubc-18::UBC-18 plasmid DNA construct 
was mixed at 75 j1gml_' with 75 jg ml! pRF4 construct. Worms used as con- 
trols against ubc-18 overexpressing strains contained 75 pg ml”! Pubc-18::GFP 
expressing construct injected with 75 1gml~' pRF4 construct. Transgenic F, 
progeny were selected on the basis of roller phenotype. Individual transgenic 
F, animals were isolated to establish independent lines. 

GFP and Dil localization. Paralysed transgenic animals were assayed for GFP 
expression at X10 and X63 magnification using a Leica 6000B digital micro- 
scope. Images were acquired using Leica FW4000 and AF6000 software. For Dil 
assays, worms were incubated with 5 ug ml! Dil in M9 at room temperature for 
2h, then transferred to fresh plates for 1h to destain, and examined under the 
fluorescence microscope. Intestinal granule autofluorescence was viewed with 
the 4,6-diamidino-2-phenylindole (DAPI) filter. 

Stress assays. For heat-shock assays, eggs were grown on plates seeded with various 
RNAi treatments to D1 adulthood. Worms were transferred to plates without food 
and heat shocked at 35 °C and were checked every 2 h for viability. Paraquat (methyl 
viologen, Sigma) assays were performed as described’’. For all stress assays a 
representative experiment from at least two independent experiments is presented. 
Pumping-rate assays. Pumping rates were determined by counting pumps of 
the terminal pharyngeal bulb for 1-min intervals. For each condition, worms 
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were treated with bacteria expressing dsRNA since hatching. The pumping rates 
of ten D1 adult worms per condition were measured and averaged. 
Solid-plate dietary restriction. Lifespans were performed as described"® with the 
following modifications: adult worms are transferred every 2—3 days to freshly 
seeded plates with restrictive amounts of bacteria (5 X 10’ to 5 X 107° bacteria 
per ml) starting at day 5 of adulthood. 

RNA isolation, RT-PCR and quantitative RT-PCR. Total RNA was isolated 
from asynchronous populations of worms and extracted using Trizol reagent 
(GIBCO). cDNA was created using Superscript II RT (Invitrogen) and oligo dT 
primers. For quantitative PCR with reverse transcription (qRT-PCR) experi- 
ments, approximately 2,000 young adult worms with roller phenotype were hand 
picked from extrachromasomal array carrying transgenic lines grown at 20 °C. 
To determine how dietary restriction could affect wwp-1 expression, bacterial 
dilution diet-restriction cultures were performed as described above but at a 
larger scale to collect 4,000 worms per sample. Total RNA was extracted using 
Trizol reagent and cDNA was created using Quantitec Reverse Transcriptase kit 
(Qiagen). SYBRgreen real-time qPCR experiments were performed as described 
in the manual using ABI Prism 7900 HT (Applied Biosystems). Quantitative 
PCR primers: wwp-1 forward, 5'‘-AAGAAGCGCAGGAGTACGAG-3’; wwp-1 
reverse, 5'-ATTGATCGAAACGCATCTCC-3'; pmp-3 forward, 5'-GTTCCC 
GTGTTCATCACTCAT-3’;  pmp-3 reverse, 5’-ACACCGTCGAGAAGCTG 
TAGA-3’. 

Recombinant proteins and GST pull-down experiments. Full-length wwp-1 
cDNA (wild-type and C762A mutant) or ubc-18 cDNA was inserted into pGEX- 
KG bacterial expression vector. Full-length skn-1b and pha-4 cDNA were 
inserted into pHis8 bacterial expression vector. His purification was performed 
using Talon Metal Affinity Resin (Clontech). GST and His purifications were 
performed according to the manufacturer’s instructions. Flag-tagged ubiquitin 
has been previously described**”’. For active protein, the GST tag on UBC-18 was 
removed by thrombin cleavage, and the supernatant was incubated with benza- 
midine Sepharose-4B beads (GE Healthcare) for 15 min at room temperature to 
remove thrombin. Full-length ubc-18 cDNA was inserted into pcDNA6/Myc- 
His mammalian expression vector to generate a Myc—His fusion protein. GST- 
pull-down experiments were performed using 500 tg NIH3T3 lysates expressing 
ubc-18 by transient transfection (Effectene, Qiagen). Lysates were prepared as 
previously described**. Lysates were incubated with GST-WWP-1 or GST bound 
to glutathione agarose beads for 1.5 h. Beads were washed four times with lysis 
buffer and analysed my SDS-PAGE. UBC-18 was detected by immunoblotting 
with an anti-Myc (9E10) antibody. 

In vitro ubiquitination assay for PHA-4 and SKN-1B. The ubiquitin ligase assay 
for PHA-4 and SKN-1B was carried out by incubating 1 jg Flag-tagged ubiqui- 
tin, 0.5 ug GST-WWP-1 (wild-type or C762A mutant), 0.5 pg UBC-18, 0.1 pg 
His—E1 and ~1 tg His-PHA-4/SKN- 1B in 30 pl reaction buffer for 1 h at 30 °C. 
UBC-18 was preincubated with 1 mM PMSF to inhibit any residual thrombin 
activity. The reaction was stopped with sample buffer and run on a protein gel 
under denaturing conditions. PHA-4 and SKN-1B were detected by immuno- 
blotting with a monoclonal anti-poly Histidine antibody (Sigma). 

RNAi treatment. RNAi-treated strains were fed E. coli (HT115) containing an 
empty vector or E. coli expressing dsRNA against the gene of interest. The wwp-1 
and pha-4 RNAi clones were from J. Ahringer’s RNAi library*’. RNAi clones 
against ubc-18, ari-1 and F56D2.2 were from M. Vidal’s RNAi library**. RNAi 
clones against daf-16, daf-2 and cyc-1 have been previously described*”**. Double 
RNAi experiments were performed as described”’. 
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A reevaluation of X-irradiation-induced phocomelia 
and proximodistal limb patterning 


Jenna L. Galloway'*, lrene Delgado**, Maria A. Ros® & Clifford J. Tabin’ 


Phocomelia is a devastating, rare congenital limb malformation in 
which the long bones are shorter than normal, with the upper 
portion of the limb being most severely affected. In extreme cases, 
the hands or fingers are attached directly to the shoulder and the 
most proximal elements (those closest to the shoulder) are entirely 
missing. This disorder, previously known in both autosomal reces- 
sive and sporadic forms, showed a marked increase in incidence in 
the early 1960s due to the tragic toxicological effects of the drug 
thalidomide, which had been prescribed as a mild sedative’. This 
human birth defect is mimicked in developing chick limb buds 
exposed to X-irradiation*®>. Both X-irradiation’ and thalidomide- 
induced phocomelia*® have been interpreted as patterning defects 
in the context of the progress zone model, which states that a cell’s 
proximodistal identity is determined by the length of time spent in 
a distal limb region termed the ‘progress zone’’. Indeed, studies of 
X-irradiation-induced phocomelia have served as one of the two 
major experimental lines of evidence supporting the validity of the 
progress zone model. Here, using a combination of molecular ana- 
lysis and lineage tracing in chick, we show that X-irradiation- 
induced phocomelia is fundamentally not a patterning defect, but 
rather results from a time-dependent loss of skeletal progenitors. 
Because skeletal condensation proceeds from the shoulder to 
fingers (in a proximal to distal direction), the proximal elements 
are differentially affected in limb buds exposed to radiation at early 
stages. This conclusion changes the framework for considering the 
effect of thalidomide and other forms of phocomelia, suggesting 
the possibility that the aetiology lies not in a defect in the patterning 
process, but rather in progenitor cell survival and differentiation. 
Moreover, molecular evidence that proximodistal patterning is un- 
affected after X-irradiation does not support the predictions of the 
progress zone model. 

According to the progress zone model’, proximodistal structures 
develop under the influence of a continuous signal, now understood 
to be a fibroblast growth factor (FGF)*, produced by the overlying 
apical ectodermal ridge (AER). Initially, the entire limb mesenchyme 
has a proximal identity; however, the mesenchymal cells at the tip of 


Consequently, proximal cells spend an increased amount of time in 
the progress zone, and ultimately are specified to distal fates’. 

The X-irradiation experiments, however, were performed before 
the identification of molecular markers for each limb segment. The 
use of such markers could provide powerful additional data support- 
ing the progress-zone-based interpretation of irradiation-induced 
phocomelia or, alternatively, drawing into question the model’s 
validity. We therefore decided to reexamine this paradigm with mod- 
ern molecular tools. 

We irradiated embryos at embryonic day (E)3.5 (stage 19-21) and 
grafted right limb buds onto host wings as described”. Skeletal ele- 
ments were examined at E9.5 (6 days post-irradiation). As in pre- 
vious studies**, the phenotype was dose-dependent. Whereas 
unirradiated limbs developed normally on the host after grafting 
(see a and e in Fig. 1A), minor skeletal malformations such as elbow 
joint fusion were observed at low doses (see b and f in Fig. 1A). 
Intermediate doses resulted in the loss of the most proximal element, 
the humerus, and anterior digits (see c and g in Fig. 1A) and, at the 
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the limb bud under the AER—those in the ‘progress zone’—are 
exposed to FGF, allowing them to transition to more distal fates. 
As the progress zone cells divide and the limb grows, not all of these 
cells remain within range of the FGF signal. The fate of the cells 
exiting the progress zone becomes fixed, whereas those in this zone 
are once again respecified to more distal fates. The progress zone 
model elegantly explained the effects of X-irradiation on developing 
limb buds? by stating that the X-irradiation-induced cell death and 
resultantly smaller limb buds cause the proximal cells to remain 
within range of the AER-produced FGF signal for a longer time than 
normal, until the limb bud recovers and grows to a sufficient size. 


Figure 1| X-irradiation of chick limbs causes phocomelia. A, Limb grafts 
(a—d, arrows; e-h) are shown next to host limbs. Controls (a, e) develop a 
humerus (h), radius (r), ulna (u) and digits (d). Low doses cause elbow 
fusion (asterisk) and reduced anterior digit 2 (f). Intermediate-dosed-limbs 
have no humerus, but form a partial radius and ulna and normal posterior 
digits (c, g). At high doses, only digits form (d, h). B, Irradiated RFLs from 
shielded embryos (lower limb; a—c) lose proximal elements compared to 
LFLs (upper limb; a—c). Some irradiated specimens lack an anterior (a, b) or 
posterior digit (c, lower limb). Occasionally, LFLs had mild abnormalities 
(asterisks). a—c are ventral images at the same scale. s, scapula; c, clavicle; 
scale bars, 1 mm. 
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highest irradiation dose, only digits formed (see d and h in Fig. 1A). 
To eliminate the possibility that the irradiation phenotype could be 
explained by reabsorption of proximal tissue after grafting, we 
performed a parallel series of experiments irradiating limbs while 
shielding the rest of the embryo. Using this method, we generated 
embryos with shorter limbs lacking proximal elements. Thus, at a 
dose of 17.5—20 grays, 28% of irradiated right forelimbs (RFLs) were 
missing the humerus, and 36% lacked the humerus and radius/ulna 
(n = 36; see lower limbs in a—c in Fig. 1B). 

For the progress zone model to explain the phocomelia phenotype, 
the growth of the limb bud must be impaired by irradiation, while 
exposure to the permissive AER-produced FGF signal must not be 
disrupted. The irradiated limb buds are smaller than unirradiated 
and contralateral controls at 24, 48 and 72 hours (h) post-irradiation 
(Supplementary Fig. 2a, c, d). To determine whether FGF signalling 
continues unabated after X-irradiation, we performed whole-mount 
in situ hybridization (WISH), examining FGF8 expression in irra- 
diated limbs from grafted or shielded embryos. In all irradiated limbs, 
at all stages examined, we found normal FGF8 expression (100%, 
n= 18; Supplementary Fig. 2a, b). Maintenance of normal FGF sig- 
nalling levels from the AER requires the expression of a second sig- 
nalling molecule, sonic hedgehog (SHH), in the posterior limb bud 
mesenchyme”’’. Consistent with this, we observed no loss in SHH 
expression in irradiated limb buds (100%, n = 13; Supplementary 
Fig. 2a, b). Moreover, TFAP2A (also known as AP-2), a transcription 
factor dependent on FGF signalling’, was expressed in irradiated 
limbs at all stages analysed (100%, n = 30; Supplementary Fig. 2c, 
d), verifying that irradiation does not disrupt AER-produced FGF 
signalling to limb mesenchyme. 

The progress zone model predicts that continual FGF signalling 
within the geometry ofa small limb bud would lead to respecification 
of proximal tissue to more distal fates, thereby explaining the pho- 
comelia. To verify this hypothesis, we investigated the specification of 
the three major elements of the limb: the stylopod/humerus, zeugo- 
pod/radius-ulna and autopod/digits. The best markers for these limb 
segments are, respectively, MEIS1/2, HOXAII and HOXA13 
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(Fig. 2C). However, their expression patterns are dynamic at early 
stages and there is limited evidence that these transcription factors 
are required for proximodistal specification’*. Nonetheless, at later 
stages they uniquely delineate each segment of the limb along the 
proximodistal axis, providing a molecular indication of their spe- 
cification. The progress zone model would predict that proximal 
cells, being respecified to distal fates, would no longer express 
MEIS1/2 and instead express distal markers such as HOXAII1 or 
HOXA13. 

Surprisingly, WISH shows no difference in the relative domains of 
expression of these markers in irradiated and unirradiated limbs 
(Figs 2A, q-s, and B, i, j), although, at an early time point, expression 
of MEIS1, MEIS2 and HOXA11 is reduced and HOXA13 expression 
was virtually undetectable in both grafted and shielded X-irradiated 
limbs (Fig. 2A and B; data not shown). This was probably due to a 
developmental delay as normal forelimb HOXA13 expression initi- 
ates after that of HOXAI1 (ref. 13). We also found that hindlimb 
HOXA13 expression was not disrupted after irradiation, consistent 
with HOXA13 hindlimb expression preceding its expression in the 
forelimb (data not shown). After 48h post-irradiation, MEIS1, 
MEIS2, HOXA11 and HOXA13 expression returned and demarcated 
the three limb segments. To confirm that there are three distinct 
proximodistal domains in irradiated limbs, we performed double 
WISH with MEIS1 and HOXA13. Similar to contralateral controls, 
three distinct domains were identifiable in irradiated limbs: a distal 
HOXA13 domain, an unstained middle region and a proximal MEIS1 
domain (Fig. 2B, e, f, i, j). 

Our data, suggesting that limb segment specification is not affected 
by X-irradiation, raise the question of why this treatment causes a 
preferential loss of proximal structures. We reasoned that this could 
be explained if X-irradiation led to changes in apoptosis, prolifera- 
tion or vasculature disproportionately in the proximal limb. 
X-irradiation causes both apoptosis and cell cycle arrest’. 
Consistent with this, we see an increase in apoptosis, using TdT- 
mediated dUTP nick end labelling (TUNEL) at 3 and 24h post- 
irradiation (Fig. 3A and Supplementary Figs 3 and 4), as well as a 
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Figure 2 | Expression analysis of irradiated limbs reveals defects in 
differentiation but not specification of the proximal segments. A, In 
irradiated limb grafts, MEIS1, HOXA11, HOXA13 and SOX9 expression is 
reduced at 24h post-irradiation (h.p.i.; i, arrowheads) compared with 
controls (a—d). At later time points, expression demarcates three limb 
segments (m-o; q-s, brackets and MEIS1 expression in inset of A, q) as in 
controls (e-g, brackets). SOX9 expression is found distally (p, t, arrows) 
compared with controls (d, h). (Grafted limbs are located in the anterior 
margin of host limbs.) B, At 24 h.p.i., irradiated RFLs displayed reduced 


i 
(mbes 
stele tes 
Hats 
oe ee 


LEL: 


Irradiated 


° 


MEIS1 HOXA11 HOXA13 


expression of MEIS1 (brown), HOXA13 (purple; b; 91%, n = 11) and SOX9 
(d; 100%, n = 6) compared with LFLs (a, c). At 48 h.p.i., MEIS1 and 
HOXA13 expression in irradiated RFLs (f, j, arrowheads; 100%, n = 12) was 
similar to that in LFLs (e, i, arrowheads). SOX9 was only expressed distally in 
irradiated RFLs (100%, n = 9) at 48 h.p.i. (h, arrows). k and I were stained at 
96 h.p.i. (u, ulna; d, digit). C, At stage 23 (left), MEISI (red) is proximal; 
HOXA11 (grey) is distal and partially overlapping with HOXA13 (purple). 
After stage 28 (right), expression demarcates three segments (Pr, proximal; 
D, distal; A, anterior; P, posterior). 
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Figure 3 | Changes in cell death and proliferation do not explain the loss of 
proximal structures. A, An increase in TUNEL-positive cells in irradiated 
RFLs from shielded embryos (d, e) and irradiated grafts (h) compared with 
unirradiated controls (a, b, g). Levels are relatively normal at 48 h.p.i. 

(c, f, i). Autofluorescence in g and i is due to blood cells. B, A decrease in 


56% decrease in mitotic cells at 3h post-irradiation by phosphory- 
lated histone H3 (pH3) staining (Fig. 3B and Supplementary Figs 3 
and 4). At later stages, neither apoptosis nor proliferation rates differ 
from contralateral and unirradiated controls or from control and 
irradiated grafts (Fig. 3A, c, f, i, and B, b, d, f, and Supplementary 
Fig. 3). Similar results were reported in previous analyses’. 
Importantly, however, neither the cell death nor the proliferative 
changes we observed were localized to or enriched in proximal 
domains of the limb. Similarly, there do not appear to be any differ- 
ential changes in the vasculature along the proximodistal axis (data 
not shown). Thus, these effects are not likely to be responsible for the 
specific failure of the proximal structures to form. 

We next examined whether X-irradiation differentially affects dif- 
ferentiation along the proximodistal limb axis. In the limb, con- 
densation of the skeletal elements occurs progressively, with 
proximal elements condensing before distal ones. SOX9 is an early 
marker of condensing mesenchyme and is necessary for cartilage and 
bone formation’. SOX9 expression is first detected in the limb bud at 
stage 21, concurrent with or just after irradiation. We find that, at 
24h post-irradiation, SOX9 expression is markedly reduced or lost in 
irradiated limb buds compared with unirradiated and contralateral 
controls (Fig. 2A, d, 1, and B, c, d). At 48h post-irradiation, SOX9 
expression is detected, but, notably, only in more distal regions of 
irradiated limb buds where condensation is initiated at later stages 
(Fig. 2A, h, p, t, and B, g, h, k, 1). Thus, this finding suggests a model in 
which X-irradiation depletes the number of cells throughout the 
mesenchyme. This depletion has dire consequences for the proximal 
tissue, which is in the early stages of chondrogenic condensation, 
and, asa result of this loss, cannot form skeletal structures. The failure 
of the proximal cells to condense is probably attributed to insufficient 
numbers of chondrocyte precursors. More distal tissue, in contrast, 
has time to recover before the proximal-to-distal wave of differenti- 
ation reaches it. A similar model has been proposed to explain seg- 
ment loss in FGF mutants’®’”. Interestingly, the idea of a threshold 
number of cells required to form skeletal elements was proposed as an 
explanation as to why structures that have already been specified were 
affected by increasing doses of irradiation’. 

If this model is correct, irradiation at early stages affects the forma- 
tion of proximal skeletal elements because cells in that domain ini- 
tiate condensation first and, hence, undergo this process 
concurrently with the cellular response to irradiation damage. 
According to this view, we reasoned that irradiation at later stages 


402 


NATURE] Vol 460/16 July 2009 


Control 


pH3 
Irradiated 


Control 


Graft 


pH3-positive cells at 3 h.p.i. in irradiated RFLs (c) compared to controls 
(a). By 24h.p.i., pH3 staining of irradiated RFLs (b) and grafts (f) is similar 
to that in controls (b, e). Sections are longitudinal except for h and f, which 
are frontal. 


should selectively affect the later-condensing distal elements. We 
irradiated the limbs of shielded embryos at E4.5 (stage 24) and exam- 
ined their skeletal elements at E8. The irradiated limbs had severely 
shortened zeugopods whereas other limb segments were minimally 
affected (see arrow in f in Fig. 4A). As expected, there was an initial 
reduction in proliferation and an increase in apoptosis throughout 
the mesenchyme of the late-irradiated limbs (Supplementary Fig. 5a; 
data not shown). Corresponding to the reduction in zeugopod size, 
SOX9 expression was abnormal and reduced in the middle segment 
of irradiated limb specimens despite the presence of HOXAI1 
expression (see arrow in b and e in Fig. 4A). 

We saw preferentially distal effects when we irradiated later E5.5 
(stage 26) limb buds. Most strikingly, a complete loss of the autopod 
was observed in these irradiated limbs (see arrow in fin Fig. 4B). The 
proximal limb cartilage had fewer TUNEL-positive cells, consistent 
with our finding that the stylopod and zeugopod are only slightly 
affected in these limbs and in agreement with the idea that differen- 
tiated cartilage is resistant to irradiation-induced cell death'® 
(Supplementary Fig. 5b). SOX9 expression was virtually lost in the 
handplate at 24 and 48h post-irradiation, in spite of continued 
HOXA13 expression (see b and arrow in e in Fig. 4B; not shown). 
Taken together, these results support the idea that X-irradiation at 
successively later stages sequentially affects more distal limb seg- 
ments, where chondrogenic condensation is occurring, and, as in 
the earlier stage irradiations, segment specification is not affected. 

These conclusions depend on the use of markers which, although 
congruent with the limb segments, may not be involved in their 
specification. Therefore, we tested our conclusions in a second 
way. According to the progress zone model where proximal tissue 
and, indeed, the entire limb bud is respecified as distal after irra- 
diation, all surviving mesenchymal cells of the limb bud, including 
the proximal ones, should contribute to the distal skeletal structures 
that form. In contrast, if specification is unaffected, but skeletal ele- 
ments fail to differentiate from proximal mesenchyme, then prox- 
imal tissues should not contribute to the distal elements forming after 
irradiation. 

To follow the fates of proximal tissue after X-irradiation, we 
injected the lipophilic dye Dil approximately 300m from the 
AER in control and irradiated stage 19-20 limb buds. Consistent 
with the results of previous fate maps'’, injection at this location in 
unirradiated limb buds labels the stylopod segment (arrowheads in a, 
d, g in Fig. 4C). In contrast, Dil injected proximally in irradiated 
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Figure 4 | X-irradiation disrupts chondrogenesis and does not convert 
proximal cells to distal fates. A, Expression of SOX9 (100%, n = 8) and 
HOXAII1 (100%, n = 6), and skeletal preparations of stage-24 irradiated 
RELs (b, e, f, arrows) and LFLs (a, c, d). B, Expression of HOXA13 (100%, 
n = 3) and SOX9 (100%, n = 3) and Alcian Blue staining of stage-26 
irradiated RFLs (b, e, f; arrows) and LFLs (a, ¢, d). C, Dil (red fluorescence) 
labels the humerus in controls (a, d, g, arrowheads; 100%, n = 5). In 
irradiated RFLs, Dil did not expand distally (b, e, h, arrowheads; 100%, 
n=A4Aand ¢, f, i, arrowheads; 100%, n = 2). Scale bar, 1 mm; h, humerus; r, 
radius; u, ulna; d, digits. 
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limbs had limited expansion and did not contribute to the distal 
skeletal elements that formed (arrowheads in b, c, e, f, h, i in 
Fig. 4C). Another lipophilic dye, DiO, however, when injected dist- 
ally within 50 ttm of the AER in irradiated limb buds, did expand into 
the distal regions (Supplementary Fig. 6). This result supports our 
hypothesis that the loss of proximal skeletal elements is due to failure 
of those segment progenitors to condense and not because the cells 
occupy the progress zone for longer periods and are respecified to 
distal fates. 

Our data, in conjunction with earlier studies reevaluating the effect 
of AER removal”®, eliminate the major experimental support for the 
progress zone model. Although our study does not address the mech- 
anism by which proximodistal specification does occur, it is consist- 
ent with models in which specification is driven by the influence of 
traditional signalling centres'*”' rather than by time spent within a 
progress zone. Most importantly, our data indicate that phocomelia 
caused by X-irradiation results from a loss of prechondrogenic pro- 
genitors and a consequent failure in differentiation, rather than from 
a defect in proximodistal patterning. 
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Increased cell death has been thought to underlie thalidomide- 
induced limb truncations in chick embryos, but whether this is a 
result of direct activation of caspase pathways’, or an indirect 
result of angiogenic inhibition, remains unclear™*. Although thalid- 
omide treatment in chick causes distal truncations, the effects of 
thalidomide in humans on predominantly proximal segments like- 
wise have been suggested to be due to cell death. Indeed, cell death has 
been linked to phocomelia in other contexts of teratogenic exposure, 
such as with nitrogen mustard**”®, and the phocomelia observed in 
these experiments has been interpreted mainly as a patterning 
defect*°. However, our results indicate that in these cases, as in the 
irradiation experiments, cell death may lead to phocomelia not by 
producing a smaller limb bud in the context of a progress zone but by 
eliminating chondrogenic precursors during a time window when 
proximal condensation is compromised but distal differentiation has 
not yet commenced. 


METHODS SUMMARY 


Eggs were obtained, maintained and incubated as described”®. In ovo irradiations 
were performed at the Hamburger Hamilton stages indicated. For the grafted 
limb experiments, unirradiated and irradiated stage 19-21 limb buds were dis- 
sected and grafted to the anterior wing border of a stage 24-25 host embryo as 
described? to overcome the subsequent death of the embryo. In ovo irradiations 
were performed with eggshell shielding with a 2-3 mm diameter hole, exposing 
the REL. This protected the embryo and the blood islands from damage while the 
REL was irradiated. 

WISH and Alcian Blue and Alizarin Red skeletal preparations were performed 
as described’”’*. Chick probes were MEIS1 and MEIS2 (ref. 21), HOXA11 and 
HOXA13 (ref. 13), FGF8 (a gift from J. C. Izpisua Belmonte), SHH”? and SOX9 
(ref. 30). AP-2 (ChEST765g1) was linearized with NotI and an antisense probe 
was generated using T3. For each gene expression, irradiation dose and time 
period, we analysed at least two irradiated grafted specimens (both irradiated leg 
and wing grafts were used) and at least three irradiated limbs from shielded 
embryos. In situ cell death detection kit, TMR red (Roche) and ApoTag fluor- 
escein in situ detection kit (Chemicon Int.) were used for TUNEL. pH3 immuno- 
histochemistry was performed with anti-pH3 (1:200, Millipore) and anti-rabbit 
AlexaFluor 594 (1:200, Invitrogen) or biotinylated secondary antibody. All 
images of irradiated RFLs and LFLs are at the same magnification. 

The lipophilic dyes Dil and/or DiO (Invitrogen) were prepared and injected 
into limb buds 1-2h after irradiation as described'’””®. After photographing, 
limbs were stained to visualize the cartilage and skeletal elements and were 
compared with the previous fluorescent images at the same magnification. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 

Irradiation. For the grafted limb experiments, a Xylon International Smart 200-E 
irradiator at 200 kV was used at a dose rate of 2.2 grays (Gy) per min at a height of 
18cm. Three different doses were used (low, 3.6; intermediate, 5.5; high, 7.25 Gy), 
which were selected according to the resulting phenotypes. For the in ovo irradia- 
tions, the RFL bud was exposed to 17.5—20 Gy (1,750—2,000 rads) in a Faxitron 
43855D irradiator at 120 kV and a height of 30.5 cm. This dose range consistently 
produced phocomelia in RFLs and never in LFLs. Some irradiated specimens lack 
anterior (see lower limbs in a, b in Fig. 1B) and posterior (see lower limb in c in 
Fig. 1B) digits. On a few occasions, the contralateral LFLs had mild phenotypes, 
including abnormally shaped skeletal elements (Fig. 1B, asterisk; humerus in 
upper limb of a) or fusion at the elbow (Fig. 1B, asterisk; upper limb in c), but 
never lost proximal elements. Whereas the phenotypes produced by shielding 
were more variable than the grafting method (probably due to variation in shield 
placement), we consistently saw phocomelia phenotypes in the affected limbs. 
Irradiated limbs from shielded embryos were only collected and used for in situ 
hybridizations, TUNEL or pH3 analysis if they were morphologically smaller and 
appeared to be affected compared with the contralateral LFLs. 

Whole-mount in situ hybridizations and skeletal preparations. After overnight 
fixation in 4% paraformaldehyde/PBS and subsequent dehydration in methanol, 
in situ hybridizations were performed as described”’. DIG-labelled probes were 
detected with BCIP (5-bromo-4-chloro-3’-indolyphosphate p-toluidine salt) and 
NBT (nitro-blue tetrazolium chloride) (Sigma) and FITC-labelled probes were 
detected with INT/BCIP (Sigma). Cartilage and bone staining were performed 
after collection at E7—-9.5 using Alcian Blue and Alizarin Red as described”*. 
Analysis of cell death, proliferation and the vasculature. Limbs were fixed in 
4% PFA for 1-2h, dehydrated in ethanol, cleared in xylene and embedded in 
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paraffin, and 7-10-u1m alternate sections were collected and used for TUNEL 
staining and pH3 immunohistochemistry. TUNEL staining was done using the 
in situ cell death detection kit, TMR red (Roche) or ApoTag fluorescein in situ 
detection kit (Chemicon Int.). For pH3-staining, antigen retrieval was followed 
sequentially by incubation with anti-pH3 (1:200, Millipore) and anti-rabbit 
AlexaFluor 594 (1:200, Invitrogen) or biotinylated secondary antibody followed 
by incubation with ABC solution (Vector Laboratories) and development in 
diaminobenzidine. Slides for TUNEL and anti-pH3 were counterstained with 
DAPI to visualize the nucleus. In some irradiated-shielded specimens there were 
more apoptotic cells on the dorsal side closest to irradiation exposure (upper 
portion of limb in panel e in Fig. 3A). The number of TUNEL and pH3-positive 
cells were counted in irradiated RFLs, contralateral LFLs and unirradiated limbs 
from shielded embryos at 3h post-irradiation, and this number was divided by 
the total pixel area of the limb for Supplementary Fig. 4. When calculating the 
pixel area, all images were at the same magnification and resolution. For both sets 
of data, two sections from at least four different irradiated specimens were 
counted. 

Using an Eppendorf pressure injector (model 5242), approximately 3 ul of 

Alexa Fluor 488 acetylated LDL (Invitrogen) was directly injected into the cir- 
culation of control and irradiated embryos. Acetylated LDL binds and is inter- 
nalized by endothelial cells and macrophages and it was used to visualize changes 
in the vasculature at several time points after irradiation. 
Lineage tracing. The lipophilic dyes Dil and/or DiO (Invitrogen) were injected 
into stage 19-20 limb buds 1-2 h after irradiation as described using a calibrated 
reticle'”’°. Embryos were allowed to develop for 4—5 days before collection and 
imaging for fluorescent signals on a Nikon Eclipse E1000 with Y-2 E/c Texas Red 
(Dil) and fluorescein isothiocyanate/HYQ (DiO) filters. 
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The AP-1 transcription factor Batf controls T,,17 


differentiation 


Barbara U. Schraml'*, Kai Hildner’?*, Wataru Ise’*, Wan-Ling Lee’, Whitney A.-E. Smith’, Ben Solomon’, 
Gurmukh Sahota’, Julia Sim*, Ryuta Mukasa°, Saso Cemerski', Robin D. Hatton’, Gary D. Stormo’, 
Casey T. Weaver”, John H. Russell’, Theresa L. Murphy’ & Kenneth M. Murphy’? 


Activator protein 1 (AP-1, also known as JUN) transcription fac- 
tors are dimers of JUN, FOS, MAF and activating transcription 
factor (ATF) family proteins characterized by basic region and 
leucine zipper domains'. Many AP-1 proteins contain defined 
transcriptional activation domains, but BATF and the closely 
related BATF3 (refs 2, 3) contain only a basic region and leucine 
zipper, and are considered to be inhibitors of AP-1 activity’ *. Here 
we show that Batfis required for the differentiation of IL17-pro- 
ducing T helper (T};17) cells’. Ty17 cells comprise a CD4* T-cell 
subset that coordinates inflammatory responses in host defence 
but is pathogenic in autoimmunity’*’. Batf_‘~ mice have normal 
Ty and T},2 differentiation, but show a defect in T}17 differenti- 
ation, and are resistant to experimental autoimmune encephalo- 
myelitis. Batf ‘~ T cells fail to induce known factors required for 
Ty17 differentiation, such as RORyt"' (encoded by Rorc) and the 
cytokine IL21 (refs 14-17). Neither the addition of IL21 nor the 
overexpression of ROR}t fully restores IL17 production in Batf 
T cells. The [117 promoter is BATF-responsive, and after T}17 
differentiation, BATF binds conserved intergenic elements in 
the I117a-Il17flocus and to the 117, 1121 and [122 (ref. 18) promo- 
ters. These results demonstrate that the AP-1 protein BATF has a 
critical role in T}17 differentiation. 

In a gene expression survey (Supplementary Fig. 1a), we identified 
the basic leucine zipper transcription factor ATF-like’ Batf to be 
highly expressed in Ty1, Ty2 and Ty17 cells compared to naive T 
and B cells. BATF and BATF3 (refs 2, 3) form heterodimers with 
JUN®” and are considered to be repressors of AP-1 activity’?**””. 
To assess the role of BATF in T cell differentiation’, we generated 
Batf ’~ mice (Supplementary Fig. 2a, b). Batf ‘~ mice lacked detect- 
able BATF protein, were fertile and appeared healthy. BATF protein 
was low in naive T cells, increased in T},2 cells, induced by activation 
(Supplementary Fig. 2), present in the nucleus and cytoplasm, but 
showed increased nuclear translocation after activation (Fig. la and 
Supplementary Fig. 1b, c). Batf ’~ mice had normal thymus, spleen 
and lymph node development, and normal CD4* and CD8™ T cell 
development (Supplementary Figs 3, 4a, b). Although Batftransgenic 
mice had altered natural killer T cell development”', Batf ’~ mice had 
normal development of natural killer T cells (Supplementary Fig. 4c), 
B cells (Supplementary Fig. 4d, e), and conventional and plasma- 
cytoid dendritic cells (Supplementary Fig. 5a, b). 

Batf~ ‘~ T cells showed normal Ty] and T,42 differentiation 
(Supplementary Fig. 6a). Under Ty17 conditions, Batf ‘~ T cells, 
but not Batf'’~ T cells, showed a marked reduction in IL17 produc- 
tion, but had normal levels of IL2, IFN-y and IL10 (Fig. 1b, c). 


Batf ‘~ DO11.10* T cells showed loss of IL17 even after several 
passages under Ty17 conditions (Supplementary Fig. 6b). Batf 
CD8" T cells also failed to produce IL17 (Supplementary Fig. 6c). We 
generated transgenic mice expressing Flag-tagged Batf under the 
control of the Cd2 promoter”. Batf-transgenic DO11.10* CD4* T 
cells and CD8* T cells had increased IL17 production under Ty17 
conditions compared to controls (Supplementary Fig. 6d, e). Lamina 
propria CD4* T cells, which constitutively express IL17 in wild-type 
mice'', failed to produce IL17 in Batf ‘" mice (Supplementary 
Fig. 6f). 

Ty17 cells are the major pathogenic population in experimental 
autoimmune encephalomyelitis’® (EAE), although factors other than 
IL17A and IL17EF can contribute to the disease”’. Batf''* mice immu- 
nized with myelin oligodendrocyte glycoprotein peptide 35-55 
(MOG(35-55); Fig. 2) developed EAE, but Batf /— mice were com- 
pletely resistant (Fig. 2a). At peak disease, central nervous system 
(CNS)-infiltrating and splenic CD4* T cells from Batf*'* mice pro- 
duced abundant IL17 and IFN-y, whereas T cells from Batf~ ‘~ mice 
did not produce IL17 (Fig. 2b and Supplementary Fig. 7a). Because 
IL6-deficient mice are resistant to EAE owing to a compensatory 
increase in FOXP3~ T regulatory (Teg) cells'*, we analysed splenic 
Batf'’* and Batf ‘~ CD4* T cells for FOXP3 expression before and 
after MOG(35-55) immunization (Supplementary Fig. 7b, c). 
Batf ‘~ mice had lower basal numbers of splenic FOXP3~ T cells 
compared to Batf'’* mice, but showed no change in FOXP3* 
expression after MOG(35-55) immunization (Supplementary Fig. 
7b, c), suggesting that their resistance to EAE is not due to an increase 
in Tyeg cells. To determine whether the resistance to EAE in Batf— a 
mice resulted from a defect within T cells or other immune cells, we 
injected naive Batf*’* CD4* T cells or PBS control buffer into mice 
before MOG(35-55) immunization (Fig. 2c). Batf~ / mice receiving 
PBS remained resistant to EAE, but Batf ’~ mice receiving naive 
Batf'’* CD4* T cells developed severe EAE (Fig. 2c and Supplemen- 
tary Table 1) with CNS-infiltrating IL17-producing CD4™~ T cells 
(Supplementary Fig. 7d). Thus, Batf ’~ mice have a T-cell-intrinsic 
defect preventing EAE. 

Batf could control Ty17 development by regulating IL6 or TGF-B 
signalling. IL6 receptor expression and IL6-induced STAT3 phos- 
phorylation were normal in Batf ’~ T cells (Supplementary Fig. 8a, 
b). TGF-B induced normal levels of FOXP3 in Batf ‘~ CD4* T cells 
(Supplementary Fig. 8d). Whereas Batf ‘~ T cells failed to fully 
downregulate FOXP3 in response to IL6 (ref. 12), neutralization of 
IL2 abrogated increased FOXP3 in Batf ’~ T cells, without restoring 
IL17 production (Supplementary Fig. 8d, e). Thus, Batf ‘~ T cells 


Department of Pathology and Immunology, “Howard Hughes Medical Institute, Department of Genetics, Department of Molecular Biology and Pharmacology, Washington 
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Figure 1| Loss of IL17 production in Batf “~ T cells. a, DO11.10* CD4* T 
cells from CD2-N-Flag-Batf transgenic mice or littermate controls were 
cultured with ovalbumin (OVA) and antigen-presenting cells (APCs) under 
Ty2 conditions for 7 days, and stained with antibodies to CD4 and Flag. 
DAPI, 4,6-diamidino-2-phenylindole; DIC, differential interference 
contrast; iono, ionomycin; TG, transgenic, unstim, unstimulated; WT, wild 
type. b, Batf'’* and Batf ‘ CD4* CD62L* CD25. T cells cultured under 
Ty17 conditions were restimulated with PMA and ionomycin on days 7 (left 
panel) or 3 (middle and right panels), and stained for IL17, IFN-y, IL2 and 
IL10. c, IL17 and IFN-y expression in DO11.10* CD4* T cells from Batf‘’*, 
Batf ‘'~ and Batf ‘~ mice activated with OVA and APCs under Ty17 
conditions. Data are representative of at least two independent experiments. 


exhibit normal TGF-B signalling and proximal IL6 signalling, indi- 
cating Batf may regulate downstream target genes. 

IL21, an early target of IL6 signalling in CD4* T cells’, is required 
for Ty17 development'*"®. IL21 was reduced in Batf ‘~ CD4* T cells 
activated under Ty17 conditions (Fig. 3a). The addition of IL21 
failed to rescue Tj;17 development in Batf ’~ T cells (Fig. 3b), but 
IL21-induced STAT3 phosphorylation was intact (Supplementary 
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Figure 2 | Batf-’~ mice are resistant to EAE. a, Batf’’* (n = 12) and 
Batf~ ‘~ (n = 13) mice were immunized with MOG(33-35) peptide. Mean 
clinical EAE scores + s.e.m are shown, and are representative of two 
independent experiments. b, Thirteen days after EAE induction, CNS- 
infiltrating lymphocytes were stimulated with PMA and ionomycin, gated 
on CD4* cells, and stained for intracellular IL17 and IFN-y (clinical scores 
are in parentheses, data are representative of 2-3 mice per group). ¢, Batf'’* 
and Batf ’~ mice were injected with control PBS buffer (n = 5) or with 
1X 10’ Batf*’* CD4* T cells (n = 6) 4 days before EAE induction. Mean 
clinical scores are shown. 


Fig. 8c), suggesting that Batf regulates other factors besides IL21 
during Ty17 differentiation. 

We performed DNA microarrays and quantitative PCR with 
reverse transcription (qRT-PCR) of Batf*’* and Batf ‘~ T cells 
activated with combinations of IL6 and/or TGF-B (Fig. 3c, d and 
Supplementary Fig. 9). This analysis identified several genes known 
to regulate Ty17 development as Batf-dependent (Fig. 3c, d, 
Supplementary Fig. 9c and Supplementary Table 2), including 
Rorc'’, Rora™, the aryl hydrocarbon receptor (Ahr)””°, [122 (ref. 
18) and [/17. However, Irf4 (ref. 13) and suppressor of cytokine 
signalling (SocsI-7) gene expression were unchanged in Batf ’~ T 
cells (Supplementary Fig. 9b and Supplementary Table 4). Early 
induction of RORyt was normal in Batf ’~ T cells, but was not 
maintained 62h after stimulation (Supplementary Fig. lla). Batf 
seemed to be necessary for the expression of a subset of IL6-induced 
genes, but was not required for the expression of TGF-B-induced 
genes (Fig. 3c, Supplementary Fig. 9a and Supplementary Tables 2 
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Figure 3 | BATF controls several T,,17-associated genes. a, IL21 expression 
in Batf'’* or Batf ‘~ T cells cultured under Ty17 conditions determined by 
qRT-PCR and ELISA. The mean and s.d. are shown, from 3 mice. b, IL17 and 
IFN-y expression of CD4* CD62L* CD25" T cells cultured as in a in the 
presence or absence of IL21. ¢, Microarray analysis of anti-CD3/CD28- 
activated T cells at 72 h, presented as heat maps of genes fivefold-induced in 
Batf‘’* T cells under Ty17 conditions. d, IL17 and IL22 expression in 
Batf‘’* or Batf ’~ CD4* T cells activated under Ty;17 conditions for 3 days. 
e, Anti-CD3/CD28-activated Batf'’* or Batf ’~ CD4* T cells were left 
uninfected or infected with retrovirus (RV) RORyt-GFP-RV or with the 
control empty-GFP-RV, and stained for IL17. GFP, green fluorescent 
protein. 


and 3). However, Batf did not globally affect IL6-induced responses, 
because IL6-induced liver acute phase responses appeared normal in 
Batf ‘~ mice (Supplementary Fig. 10). 

Because RORyt acts directly on the Jl17 promoter’’*, we 
addressed whether RORyt could rescue Ty17 development in 
Batf ‘~ T cells. In Batf'’* T cells, retroviral RORyt expression 
induced 38% IL17 production, compared to only 1.6% IL17 produc- 
tion induced by control retrovirus (Fig. 3e and Supplementary 
Fig. 11c)!"°. However, in Batf ’~ T cells, retroviral RORyt expres- 
sion induced only 5.7% IL17 production (Fig. 3e and Supplementary 
Fig. llc). Even under T},17-inducing conditions, retroviral RORyt 
expression did not fully restore IL17 production in Batf ‘~ T cells 
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(Supplementary Fig. 11b, c). Retroviral expression of both BATF and 
RORyt in Batf ‘~ T cells induced 26% IL17 production, compared to 
only 5% with RORyt alone and 14% with BATF alone 
(Supplementary Fig. 11d), suggesting that there is potential synergy 
between RORyt and BATFE, and a possible direct action of BATF in 
transcription of [117 and other Ty17-specific genes. 

We used a reverse-strand retroviral reporter” to examine [117 
promoter activity in primary Batf‘/" and Batf ‘~ T cells (Fig. 4a). 
Three days after activation, Batf ’- CD4* T cells showed consid- 
erably less reporter activity than Batf’’* T cells, suggesting that the 
proximal J/17 promoter is Batf-responsive (Fig. 4a). Using chromatin 
immunoprecipitation (ChIP) analysis of several conserved regions 
within the [117a—Il17flocus (Supplementary Fig. 12a), we found that 
BATFE specifically bound to the +9.6 kilobase (kb) and +28 kb inter- 
genic regions within 24 h after activation (Fig. 4b and Supplementary 
Fig. 12b, c). By day5 after stimulation, BATF bound specifically to 
several intergenic regions and to the proximal J/17a and Il17f pro- 
moters (Fig. 4b and Supplementary Fig. 12b, c), with distal elements 
showing more rapid and stronger binding than proximal elements. 
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Figure 4 | BATF directly regulates IL17 expression. a, Bat; “* and Batf~ = 
CD4* T cells cultured under T}17 conditions were infected with hCD4-pA- 
GFP-RV-IL17p reporter virus. GFP expression after PMA and ionomycin 
restimulation is shown. b, Batf'’* and Batf ‘~ CD4* T cells cultured under 
Ty17 conditions for 5 days were subjected to ChIP analysis of the indicated 
regions using an anti-BATF antibody (mean and s.d.). ce, EMSA supershift 
analysis of Tj;17 whole cell extracts using a consensus AP-1 (c, e) or the IL17 
(—155 to —187) (d) probe. Bat / (wild type, WT), Batf ’~ (knockout, 
KO), CD2-N-Flag-Batf transgenic (TG), IL17 (—155 to — 187) and retinoic- 
acid-receptor-related orphan receptor (ROR) response element (RORE) 
probes were used as competitors. Anti-JUN denotes antibodies with 
pan-JUN specificity. 
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We next examined BATF binding to a consensus AP-1 probe® by 
electrophoretic mobility shift assays (EMSAs). This probe formed 
two complexes in Batf'’' Ty17 cell extracts (Fig. 4c) that were 
dependent on stimulation (Supplementary Fig. 13a). Only the upper 
complex formed in Batf ‘~ Ty17 cells (Fig. 4c). An anti-BATF anti- 
body inhibited the lower complex. In CD2-N-Flag- Batf-transgenic 
Ty17 cell extracts, the lower complex was specifically supershifted by 
an anti-Flag antibody (Fig. 4c). Thus, only the lower complex binding 
the consensus AP-1 probe in Ty;17 cells contains BATF. 

Several potential BATF-binding sites were identified by EMSAs in 
the 1117, 1121 and [122 proximal promoters, including the I/17 pro- 
moter region (— 188 to —210) that bound BATF in ChIP (Fig. 4b and 
Supplementary Fig. 13b—d). Another BATF-binding 117 promoter 
region (—155 to — 187) overlapped with a reported RORyt-binding 
element”’. As an EMSA probe, this region forms two complexes in 
Ty17 cells (Fig. 4d), with the lower complex being selectively inhib- 
ited by an anti-BATF antibody, absent in Batf ’~ T,17 cells, and 
supershifted by an anti-Flag antibody in Batf-transgenic T}17 
extracts (Fig. 4d). We confirmed BATF binding to the [/21 and [122 
promoters by ChIP analysis (Supplementary Fig. 13e). The program 
CONSENSUS” determined that the BATF-binding element in the 
II17, 1121 and [122 promoters resembles canonical AP-1 elements at 
positions 1 to 3, with variation at the remaining nucleotides 
(Supplementary Fig. 13f). CONSENSUS did not identify other tran- 
scription-factor-binding sites enriched near BATF-binding elements. 
We determined the composition of the BATF-containing complex 
using supershift analysis (Fig. 4e). The upper complex supershifted 
with a pan-anti-FOS antibody, whereas the lower complex super- 
shifted with a pan-anti-JUN and anti-BATF antibodies. Anti-JUNB 
supershifted most of the lower complex, but antibodies against JUN 
(also known as c-JUN), JUND, ATF1 or ATF3 did not. Thus, BATF 
preferentially forms heterodimers with JUNB during Ty17 differ- 
entiation. 

Although Batfand Batf3 were considered to be AP-1 inhibitors**, 
we have shown that they are required for the development of specific 
immune lineages’. Batfis selectively required for Ty17 development, 
but unlike Irf4 (ref. 13), it is not required for Ty2 development. 
Because Batf is also expressed in Ty1 and Ty2 cells, it probably 
cooperates with other T};17-specific factors to regulate target genes. 
Future work will determine whether the actions of BATF involve 
distinct DNA binding specificity or unique protein—protein interac- 
tions with T},17 specific factors. 


METHODS SUMMARY 

Mice. Batf ’ mice were generated by homologous recombination, deleting 
exons | and 2 of the Batf gene on the pure 129SvEv genetic background. The 
neomycin-resistance cassette was removed from the targeted Baff allele in 
embryonic stem cells before the generation of mice. 

T cell differentiation assays. Naive CD4* CD62L* CD25" T cells (also known 
as CD4* SELL” IL2RA_ ) were isolated by cell sorting and activated with plate- 
bound anti-CD3 (also known as CD3E) and soluble anti-CD28 antibodies. 
Cultures were supplemented with anti-IL4 (11B11; hybridoma supernatant), 
IFN-y (Peprotech; 0.1 ng ml ') and IL12 (Genetics Institute; 10 U ml ') for 
Tul; anti-IFN-y (H22; gift from R.D. Schreiber; 10 pg ml '), anti-IL12 (Tosh; 
BioXcell; 10 pg ml!) and IL4 (Peprotech; 10 ng ml!) for Ty2; anti-IL4, anti- 
IL12, anti-IFN-y, IL6 (Peprotech; 20ng ml!) and TGF-B (Peprotech; 
0.5ngml_') for T;17 differentiation. Unless otherwise indicated, 3 days after 
activation cells were restimulated with phorbol myristate acetate (PMA) and 
ionomycin for 4h for intracellular cytokine analysis by flow cytometry. 
Intracellular staining. For intracellular cytokine staining, cells were stained for 
surface markers followed by fixation with 2% formaldehyde for 15 min at room 
temperature. Cells were then washed once in 0.05% saponin and stained with 
anti-cytokine antibodies in 0.5% saponin. Anti-phospho-STAT3 antibody (BD 
Pharmingen) was used according to the manufacturer’s recommendations. In 
brief, cells were stained for surface markers followed by fixation with 90% meth- 
anol at —20°C overnight. Cells were then washed and stained for phosphory- 
lated-STAT3 in PBS containing 3% FCS. FOXP3 staining was performed 
according to the manufacturer’s recommendations using FOXP3 staining buf- 
fers (eBioscience). 


408 


NATURE|Vol 460|16 July 2009 


Induction of EAE. Mice (7-10 weeks old) were immunized subcutaneously with 
100 pg MOG(35-55) peptide (Sigma) emulsified in complete Freund’s adjuvant 
(incomplete Freund’s adjuvant supplemented with 500 ug Mycobacterium 
tuberculosis). One and three days later, mice were given 300 ng Pertussis Toxin 
(List Biological Laboratories) intraperitoneally. Clinical scores were assessed 
as described in Methods. For T-cell transfer experiments, mice were injected 
with either PBS or 10’ Batf'/* CD4* T cells 4days before MOG(35-55) 
immunization”. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 


Received 9 April; accepted 5 May 2009. 
Published online 5 July 2009. 


1. Wagner, E. F. & Eferl, R. Fos/AP-1 proteins in bone and the immune system. 
Immunol. Rev. 208, 126-140 (2005). 

2. Hildner, K. et al. Batf3 deficiency reveals a critical role for CD8a* dendritic cells in 
cytotoxic T cell immunity. Science 322, 1097-1100 (2008). 

3. lacobelli, M., Wachsman, W. & McGuire, K. L. Repression of IL-2 promoter 
activity by the novel basic leucine zipper p21°\*" protein. J. Immunol. 165, 
860-868 (2000). 

4. Blank, V. Small Maf proteins in mammalian gene control: mere dimerization 
partners or dynamic transcriptional regulators? J. Mol. Biol. 376, 913-925 (2008). 

5. Williams, K. L. et al. Characterization of murine BATF: a negative regulator of 
activator protein-1 activity in the thymus. Eur. J. Immunol. 31, 1620-1627 (2001). 

6.  Echlin, D.R., Tae, H. J., Mitin, N. & Taparowsky, E. J. B-ATF functions as a negative 
regulator of AP-1 mediated transcription and blocks cellular transformation by 
Ras and Fos. Oncogene 19, 1752-1763 (2000). 

7. Dorsey, M. J. et al. B-ATF: a novel human bZIP protein that associates with 
members of the AP-1 transcription factor family. Oncogene 11, 2255-2265 (1995). 

8. Thornton, T. M., Zullo, A. J., Williams, K. L. & Taparowsky, E. J. Direct 
manipulation of activator protein-1 controls thymocyte proliferation in vitro. Eur. J. 
Immunol. 36, 160-169 (2006). 

9. Harrington, L. E. et al. Interleukin 17-producing CD4* effector T cells develop viaa 

ineage distinct from the T helper type 1 and 2 lineages. Nature Immunol. 6, 

123-1132 (2005). 

O. Langrish, C. L. et al. IL-23 drives a pathogenic T cell population that induces 

autoimmune inflammation. J. Exp. Med. 201, 233-240 (2005). 

1. Ivanov, I. |. et al. The orphan nuclear receptor RORyt directs the differentiation 

program of proinflammatory IL-17* T helper cells. Cel! 126, 1121-1133 (2006). 

2. Bettelli, E. et al. Reciprocal developmental pathways for the generation of 

pathogenic effector Ty17 and regulatory T cells. Nature 441, 235-238 (2006). 

3. Brustle, A. et al. The development of inflammatory T,417 cells requires interferon- 

regulatory factor 4. Nature Immunol. 8, 958-966 (2007). 

4. Korn, T. et al. IL-21 initiates an alternative pathway to induce proinflammatory 
Ty17 cells. Nature 448, 484-487 (2007). 

5. Nurieva, R. et al. Essential autocrine regulation by IL-21 in the generation of 
inflammatory T cells. Nature 448, 480-483 (2007). 

6. Wei,L., Laurence, A., Elias, K. M. & O'Shea, J. J. IL-21 is produced by Th17 cells and 
drives IL-17 production in a STAT3-dependent manner. J. Biol. Chem. 282, 
34605-34610 (2007). 

7. Zhou, L. et al. IL-6 programs Ty17 cell differentiation by promoting sequential 
engagement of the IL-21 and IL-23 pathways. Nature Immunol. 8, 967-974 (2007). 

8. Liang, S.C. et al. Interleukin (IL)-22 and IL-17 are coexpressed by Th17 cells and 

cooperatively enhance expression of antimicrobial peptides. J. Exp. Med. 203, 

2271-2279 (2006). 

9. Bower, K. E., Fritz, J. M. & McGuire, K. L. Transcriptional repression of MMP-1 by 

p27°“FT and reduced in vitro invasiveness of hepatocarcinoma cells. Oncogene 23, 

8805-8814 (2004). 

20. Hess, J., Angel, P. & Schorpp-Kistner, M. AP-1 subunits: quarrel and harmony 

among siblings. J. Cell Sci. 117, 5965-5973 (2004). 

21. Williams, K. L. et al. BATF transgenic mice reveal a role for activator protein-1 in 

KT cell development. J. Immunol. 170, 2417-2426 (2003). 

22. Zhumabekov, T., Corbella, P., Tolaini, M. & Kioussis, D. Improved version of a 

human CD2 minigene based vector for T cell-specific expression in transgenic 

mice. J. Immunol. Methods 185, 133-140 (1995). 

23. Haak, S. et al. IL-17A and IL-17F do not contribute vitally to autoimmune neuro- 
inflammation in mice. J. Clin. Invest. 119, 61-69 (2009). 

24. Yang, X. O. et al. T helper 17 lineage differentiation is programmed by orphan 
nuclear receptors ROR and RORY. Immunity 28, 29-39 (2008). 

25. Veldhoen, M. et al. The aryl hydrocarbon receptor links T417-cell-mediated 
autoimmunity to environmental toxins. Nature 453, 106-109 (2008). 

26. Quintana, F. J. et al. Control of T,eg and T}17 cell differentiation by the aryl 
hydrocarbon receptor. Nature 453, 65-71 (2008). 

27. \chiyama, K. et al. Foxp3 inhibits RORyt-mediated IL-17A mRNA transcription 
through direct interaction with RORyt. J. Biol. Chem. 283, 17003-17008 (2008). 

28. Zhang, F., Meng, G. & Strober, W. Interactions among the transcription factors 
Runx1, RORyt and Foxp3 regulate the differentiation of interleukin 17-producing T 
cells. Nature Immunol. 9, 1297-1306 (2008). 

29. Zhu, H. et al. Unexpected characteristics of the IFN-y reporters in nontransformed 
T cells. J. Immunol. 167, 855-865 (2001). 


©2009 Macmillan Publishers Limited. All rights reserved 


NATURE| Vol 460|16 July 2009 


30. Hertz, G. Z.& Stormo, G. D. Identifying DNA and protein patterns with statistically 
significant alignments of multiple sequences. Bioinformatics 15, 563-577 (1999). 


Supplementary Information is linked to the online version of the paper at 
www.nature.com/nature. 


Acknowledgements We thank R. Lallone for anti-BATF antibody preparation, and 
B. Sleckman for Cre-expressing adenovirus. This work was supported by the 
Howard Hughes Medical Institute (K.M.M.), and grants from the National 
Institutes of Health HGO0249 and training grant GMO7200 (G.D.S.), Al035783 
(C.T.W.), ARO49293 (R.D.H.), and from Daiichi-Sankyo Co. Ltd (C.T.W.). 


Author Contributions B.U.S. generated Batf “~ mice, designed and analysed the 
experiments, interpreted results and wrote the manuscript. K.H. constructed the 


LETTERS 


targeting vector and probes, transgenic vector, and recombinant BATF. W.I. helped 
with retroviral expression experiments. W.-L.L. helped with reverse-strand 
reporter analysis. W.A.-E.S. helped with mouse generation. B.S. helped with EMSA 
analysis. G.S. and G.D.S. performed bioinformatics analysis for the BATF binding 
elements. J.S. and J.H.R. helped with EAE experiments. R.M., R.D.H. and C.T.W. 
performed ChIP experiments. T.L.M. and S.C. performed confocal microscopy for 
BATF. K.M.M. directed the study and wrote the manuscript. 


Author Information Microarray data are available at Array Express (http:// 
www.ebi.ac.uk/array express/) under the accession numbers E-MEXP-1518, 
E-MEXP-2152 and E-MEXP-2153. Reprints and permissions information is available 
at www.nature.com/reprints. Correspondence and requests for materials should 
be addressed to K.M.M (kmurphy@wustl.edu). 


409 


©2009 Macmillan Publishers Limited. All rights reserved 


doi:10.1038/nature08114 


METHODS 

Generation of Batf ~ mice. Murine Batf exons 1-2 were deleted by homolog- 
ous recombination via a targeting vector constructed in pLNTK”! using a 1 kb 
genomic fragment (left arm) upstream of the Batfexon 1, and a 3.6 kb genomic 
fragment (right arm) downstream of exon 2. The left arm was generated by PCR 
from genomic DNA with the use of the oligonucleotides: left arm forward, 
5'-ATTACTCGAGTGAAACAAACAGGCAGTCGCAGTG-3’; left arm reverse, 
5'-ATTACTCGAGCCTACTACCTTTCAGGGCTACTGC-3’ (bold nucleotides 
indicate Xhol restriction-enzyme sites). The right arm was generated by PCR 
with the use of the oligonucleotides: right arm forward, 5'-ATTAGTCGAC- 
GCATTCTTCATGGTCCTTAGCCTTGG-3’; right arm reverse, 5’-ATTA- 
GTCGACCAGAGAATGAGAAATGTTGGAGG-3’ (bold nucleotides indicate 
Sall restriction-enzyme sites). EDJ22 embryonic stem cells were transfected with 
linearized targeting vector and targeted clones were identified by Southern blot 
analysis using probes A and B located 5’ to the left arm and 3’ to the right arm, 
respectively. Probe A was generated using the oligonucleotides 5’- 
CAACTGGGTCTGAGTCAAGAGGT-3' and 5’-CGTAGCCGCTGATTGTTT- 
TAGAAC-3’ to generate a 531-bp product. Probe B was generated using 
the oligonucleotides 5'-ACAGCTTGAACTTCAGAGCCCTCC-3’ and 5’- 
CACATTTAAGTCACAATAACACTGC-3’ to generate a 772-bp product. The 
neomycin-resistance cassette was deleted from successfully targeted clones by in 
vitro treatment with Adeno-Cre virus (gift from B. Sleckman, Washington 
University), and targeted clones with successful neo deletion were identified 
by Southern blot using probes A and B (Supplementary Fig. 2a, b). Blastocyst 
injections were performed with two distinct recombinant clones, each of which 
generated germline transmission of the targeted Batfallele. Male chimaeras were 
crossed with 129SvEv females to establish Batf mutants on the pure 129SvEv 
genetic background. All experiments were performed with mice containing the 
neo-deleted mutant allele. Homozygous mice were obtained by intercrossing 
heterozygous siblings, and littermates were used as controls in most experiments. 
For some experiments 129SvEv wild-type mice purchased from Taconic served 
as controls. For experiments with DO11.10 transgenic Batf ‘~ mice, mice were 
crossed to BALB/c mice for at least five generations, and littermates were used as 
control. 

For the generation of transgenic mice, Batf complementary DNA was cloned 
from CD4* T cell messenger RNA using primers 5’-GGAAGATTAGAACCAT- 
GCCTC-3’ and 5'-AGAAGGTCAGGGCTGGAAG-3’, and subcloned into the 
GFP-RV retrovirus’. An amino-terminal Flag tag was introduced by Quick 
Change Mutagenesis kit (Stratagene) using the primers 5’-GGACTACA- 
AAGACGATGACGACAAGCCTCACAGCTCCGACAGCA-3’ and 5'-CTTG- 
TCGTCATCGTCTTTGTAGTCCATGGTTCTAATCTTCCAGATC-3’. — The 
underlined sequence indicates nucleotides used to introduce the Flag-tag. The 
Flag-tagged Batfwas cloned into the CD2 microinjection cassette” via blunt-end 
strategy into a Smal-digested CD2 microinjection cassette. Transgene expres- 
sion in CD4* T cells was tested by anti-Flag western blot. CD2-N-Flag-Batf 
transgenic mice were crossed to C57BL/6 and BALB/c mice for at least five 
generations. Transgene-negative littermates were used as control mice. Mice 
were bred and maintained at the animal facilities at Washington University. 
All animal experiments were approved by the Animal Studies Committee at 
Washington University. 

Visualization of lymph nodes. To visualize superficial inguinal lymph nodes, 
mice were injected with 50 ul of 1% Evans Blue dye solution into each hind foot 
pad. After 1.5h mice were euthanized and lymph nodes were visualized using a 
dissecting microscope™. 

Western blot analysis. To test for residual BATF protein expression, total sple- 
nocytes from Batf*/* and Batf~/~ 129SvEv mice were stimulated with anti-CD3 
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for 3 days under T}17 conditions. Cells were then lysed in RIPA buffer, electro- 
phoresed on 15% polyacrylamide gels, transferred to nitrocellulose, and analysed 
by western blot with rabbit anti-murine BATF polyclonal serum and HRP-con- 
jugated anti-rabbit immunoglobulin antibody (Jackson ImmunoResearch). 
Affinity purified rabbit anti-murine BATF polyclonal serum (Brookwood 
Biomedical) was generated by immunization with full-length recombinant 
BATF protein. Equal protein loading was assessed by subsequent immunoblot- 
ting with antibody to B-actin (Santa Cruz Biotechnology) and HRP-conjugated 
anti-mouse antibody (Jackson ImmunoResearch). 

For analysis of BATF protein expression in naive CD4* T cells, CD4* T cells 
from Batf*!* and Batf '~ 129SvEv mice were magnetically purified. Equal cell 
numbers were lysed in RIPA buffer and subjected to western blot analysis as 
described above. 

For analysis of BATF expression in T;,2 cells, magnetically purified CD4* T 
cells from Batf*/* and Batf~/~ mice were activated with anti-CD3/CD28 in the 
presence of IL4, anti-IL12 (Tosh) and anti-IFN-y (H22). On day 4, cells were left 
unstimulated or stimulated with PMA and ionomycin for 4h. Cells were col- 
lected by centrifugation, washed with PBS, and resuspended (100 X 10° 
cellsml-') in Affymetrix Chip lysis buffer (10 mM Tris, pH 7.5, 10mM NaCl, 
3mM MgCl, 0.5% IGEPAL, with protease inhibitors (PMSF, aprotinin, leupep- 
tin)). After 5 min at 4 °C, nuclei were collected by centrifugation (800g for 3 min, 
4°C) and lysed in RIPA (100 X 10° cell equivalents per ml) with protease inhi- 
bitors. Nuclear lysates were centrifuged for 10 min at 4 °C 15,000g, diluted with 
an equal volume of 2X SDS-PAGE sample buffer containing 2-mercaptoeth- 
anol, and extracts from equal cell numbers were subjected to western blot ana- 
lysis using rabbit anti-murine BATF polyclonal serum. Equal protein loading 
was assessed by subsequent immunoblotting with antibody to Lamin B 
(Santa Cruz Biotechnology) and HRP-conjugated anti-goat Ig (Jackson 
ImmunoResearch). 

Immunohistochemistry. CD4* T cells from CD2-N-Flag-Batf transgenic mice 
were isolated by magnetic separation and either left untreated or stimulated with 
PMA and ionomycin for 4h. Cells were then allowed to settle on poly-t-lysine- 
treated slides, fixed with 4% formaldehyde, permeabilized with 0.25% 
Triton X-100, and stained with an anti-Flag antibody (M2, Sigma Aldrich) 
according to the manufacturer’s recommendations. A goat anti-mouse AF-488 
(Invitrogen) antibody was used to detect anti-Flag staining. For analysis of 
cellular localization of BATF in Ty,2 cells, DO11.10 CD4* T cells from CD2- 
N-Flag-Batf transgenic mice were isolated and differentiated with OVA and 
APCs under T}2 conditions for 7 days. On day 7, cells were either left untreated 
or stimulated with PMA and ionomycin for 4 h. Cells were stained with anti-Flag 
antibody as described earlier. Cells were also stained with anti-CD4 APC anti- 
body (BD Biosciences). Confocal images were obtained with the Olympus 
FV1000 microscope and software using a X60 oil objective. The pinhole was 
set to 110 41m. The excitation/emission settings used for DAPI, Alexa 488 and 
Alexa 633 were 405/461 nm, 488/520 nm and 635/668 nm, respectively. 

Further methods can be found in the Supplementary Information. 
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Cohesins form chromosomal cis-interactions at the 
developmentally regulated IFNG locus 


Suzana Hadjur’, Luke M. Williams’, Natalie K. Ryan’, Bradley S. Cobb!, Tom Sexton’, Peter Fraser’, 


Amanda G. Fisher’ & Matthias Merkenschlager’ 


Cohesin-mediated sister chromatid cohesion is essential for chro- 
mosome segregation and post-replicative DNA repair’. In addi- 
tion, evidence from model organisms** and from human 
genetics’ suggests that cohesin is involved in the control of gene 
expression®”. This non-canonical role has recently been rationalized 
by the findings that mammalian cohesin complexes are recruited to 
a subset of DNase I hypersensitive sites and to conserved noncoding 
sequences by the DNA-binding protein CTCF'*’*. CTCF functions 
at insulators (which control interactions between enhancers and 
promoters) and at boundary elements (which demarcate regions 
of distinct chromatin structure)'*, and cohesin contributes to its 
enhancer-blocking activity’®'’. The underlying mechanisms remain 
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Figure 1| Developmentally regulated cohesin and CTCF binding at IFNG. 
a, ChIP and genomic-tiling array data for the mouse B3 pre-B cell line'° show 
log2 enrichment of the cohesin subunit RAD21 (black) and CTCF (grey) at 
Ifng. Schematic representation of the mouse Ifng region (m chromosome 
(chr.) 10) and the human JFNG region (h chr.12), the red flash symbol 
indicates genomic rearrangement at [126, a pseudogene in rodents”’. b, ChIP 
and real-time PCR mapping of RAD21 (top) and CTCF (bottom) at human 
IFNG in 293T cells (white), non-polarized CD4 T cells (grey), CCR5~ Ty1 
(red) and CRTH2* T;2 (black) effector memory cells. *P < 0.05, 

**P < 0.01, ***P < 0.005; Student’s t-test. Primer positions are indicated 
relative to the IFNG transcription start site. Human/mouse identity at the 
—63 kb site is 59% over 291 base pairs (bp) surrounding a canonical CTCF 


unknown, and the full spectrum of cohesin functions remains to be 
determined. Here we show that cohesin forms the topological and 
mechanistic basis for cell-type-specific long-range chromosomal 
interactions in cis at the developmentally regulated cytokine locus 
IFNG. Hence, the ability of cohesin to constrain chromosome topo- 
logy is used not only for the purpose of sister chromatid cohesion’”, 
but also to dynamically define the spatial conformation of specific 
loci. This new aspect of cohesin function is probably important for 
normal development** and disease’. 

One model of CTCF function is the formation of chromatin loops, 
which have been demonstrated at the H19/IGF2, B-globin and major 
histocompatibility complex (MHC) class IT loci’*"’”. Whether cohesin 
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consensus motif. Two evolutionarily conserved regions (http:// 
www.dcode.org) are found at +1.5 kb (70.2% and 69.1% identity over 104 
and 217 bp), and three at +119 kb (70.0%, 71.0% and 62.2 and 69.1% 
identity over 120, 100 and 45 bp). Positive control sites at chr.11: 118,283 kb, 
chr.11: 118,333 kb, chr.8: 24,891 kb and CD8 cluster I were included in all 
experiments (Supplementary Fig. 1), and data are normalized to chr.11: 
118,283 kb (mean and-s.d., n = 4). ¢, Isolation (left) of human effector 
memory CCR5* Ty1 and CRTH2* Ty2 cells and cytokine expression were 
assessed by intracellular staining and flow cytometry (middle) or real-time 
RT-PCR analysis (right) before (contr.) and after activation with plate 
bound anti-CD3 and anti-CD28 (activ.). 
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controls chromosome conformation is important for understanding 
its contribution to insulator and boundary element function”, and for 
determining the significance of cohesin binding to genomic locations 
that are unlikely to constitute insulators or chromatin boundaries, 
including sites within the coding regions of active genes'®"’. 

Cohesin binding within a developmentally regulated gene is exem- 
plified by IFNG, which encodes the cytokine interferon-y (IFN-y). A 
conserved cohesin site is located within the first intron of the mouse 
Ifng and the human IFNG coding region’®. The Ifng locus contains 
several conserved noncoding sequences and putative cis-regulatory 
elements that are located at considerable distances from the coding 
region’’*' and can function as enhancers or insulators’. Genomic- 
tiling arrays'’® showed major binding sites for the cohesin subunit 
RAD21 and for CTCF at —73 and +66kilobases (kb) in mouse 
lymphoid cell lines (Fig. la). Chromatin immunoprecipitation 
(ChIP) and real-time PCR analysis of the human 293T cell line and 
of primary CD4 T cells sorted from human peripheral blood demon- 
strated strong binding of cohesin and CTCF at orthologous 
sequences located 63 kb upstream of IFNG, and at evolutionarily 
conserved regions (http://www.decode.org/) at +1.5 and +119 kb 
in the human IFNG region (Fig. 1b, see legend for details). Analysis 
of genome-wide ChIP-sequencing data confirmed —63, +1.5 and 
+119 kb as the major CTCF sites in human CD4 T cells, and showed 
that features of canonical CTCF sites, namely the histone H2A variant 
H2AZ (also known as H2AFZ) and trimethylated histone H3 lysine 4 
(H3K4me3) were present at —63 and +1.5kb (data not shown). 

Naive, non-polarized CD4 T cells are the progenitors of specialized 
T helper (Ty) cell types, including Ty1 cells (which confer immunity 
to intracellular pathogens) and Ty,2 cells (which aid antibody pro- 
duction and mediate responses to helminths). Ty cell differentiation 
has become a model for how genetically identical cells acquire dis- 
tinct gene expression programs by the interplay of transcription fac- 
tors and chromatin regulators*’. The IFNG locus is prepared for 
expression during the differentiation of T}y1 cells: DNA methylation 
declines”, cell-type-specific hypersensitive sites emerge’””' and 
IFNG transcription becomes inducible”. Conversely, IFNG is 
silenced during T};2 differentiation”’. To gain insight into the develop- 
mental regulation of cohesin and CTCF at the human IFNG locus 
we isolated Ty1 and Ty2 effector memory cells (Fig. 1c). These cells 
arise in vivo, can be identified by the expression of the chemokine 
receptor CCR5 and the prostaglandin D2 receptor CRTH2 (also 
known as CD294 or GPR44), respectively*’, and retain their cytokine 
profiles when expanded in vitro under non-polarizing conditions”. 
Real-time PCR with reverse-transcription (RT-PCR) showed that 
IFNG transcripts were highly induced after activation of CCR5* 
Tul cells (Fig. 1c), and intracellular staining showed the inducible 
expression of IFN-y protein in CCR5* Ty cells, whereas CRTH2* 
Ty2 cells expressed interleukin-4 (IL4, Fig. 1c). The positioning of 
CTCE and cohesin across IFNG was similar between non-polarized 
CD4 T cells, CCR5* Ty! cells, CRTH2* Ty2 cells and non-lymphoid 
293T cells (Fig. 1b), consistent with the reported preference of CTCF 
and cohesin for constitutive hypersensitive sites'’®. Interestingly, 
however, both cohesin and CTCF were significantly more abundant 
at the —63, +1.5 and +119kb sites in CCR5~ Ty cells (Fig. 1b). 
Increased cohesin and CTCF binding in CCR5* Ty] cells was selective 
for the IFNG locus, because CCR5* Tyyl and CRTH2* Ty;2 cells 
expressed similar levels of RAD21 and CTCF (Supplementary 
Fig. la), and the occupancy of control sites outside the LFNG region 
was equivalent (Supplementary Fig. 1b). 

Analysis of activating and repressive post-translational histone 
modifications across the IFNG region suggested boundary element 
function for the —63 site, but not for the +1.5 or the +119 kb sites 
in non-polarized human CD4 Tcells*. We therefore explored 
whether these sites have additional functions in organizing the 
IFNG region in three-dimensional nuclear space by chromosome 
conformation capture?” (3C) experiments on CCR5* Tyl and 
CRTH2* Ty;2 chromatin templates (Supplementary Fig. 2). The 
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inducible nature of IFNG expression in CCR5* Ty] cells allowed us 
to define the chromosome topology of the locus in its inducible state 
and independently of changes in locus conformation that may accom- 
pany high-level transcription. Using a HindIII restriction fragment 
containing the intronic + 1.5 kb site as a bait, we looked for long-range 
interactions with sites across the IFNG genomic region. Crosslinking 
frequencies fell with increasing distance from the bait, but were sig- 
nificantly increased for the CTCF and cohesin sites at —63 and 
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Figure 2 | Cell-type-specific long-range chromosomal interactions at IFNG 
are based on CTCF/cohesin sites. a, Chromosome conformation capture 
(3C) and Taqman PCR were used to determine interactions between IFNG 
+1.5 kb (beacon) and primers (black arrowheads) placed within the 
indicated HindIII fragments (shaded blue) in non-polarized CD4 T cells 
(grey), CCR5~ Ty1 (red) and CRTH2* Ty2 (black) cells (Supplementary 
Fig. 2 and Supplementary Table 1). Crosslinking efficiencies are expressed 
relative to +119 kb in CCR5 cells and normalized to mouse Acta2 (see 
Methods and Supplementary Fig. 2) or to human ACTA2 with similar results 
(Supplementary Fig. 3a). Cohesin sites and the approximate positions of the 
IFNG and IL26 genes are indicated (mean + s.d., n = 5 for CCR5 and 
CRTH2, n = 4 for non-polarized CD4 T cells). P< 0.005; Student’s t-test, 
Ty versus non-polarized, and Ty;1 versus T}2. b, 3D-FISH to determine the 
nuclear diameters (top) of CCR5* Ty (red, 9.4 + 1.1 jum, n = 50) and 
CRTH2* Ty2 (black, 9.5 = 1.5 jum, m = 50) cells, and the distances between 
IFNG alleles (bottom) in individual CCR5 (red, 4.5 + 1.6 um, n = 50, 
minimum 1.5 um) and CRTH2 (black, 5.0 + 1.9 um, n = 50, minimum 

1.1 um) cells. ¢, Long-range interactions at IFNG in CCR5* Ty] cells occur 
before DNA replication. 3C analysis of IFNG in total CCR5~ Tyy1 cells (red) 
compared to CCR5* Ty] cells in the G1 phase of the cell cycle (orange). Data 
are presented as in a (meanands.e.m., n = 2). 
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+119 kb, indicating long-range interactions at IFNG that occur selec- 
tively in CCR5* Ty] cells (Fig. 2a). Nuclear dimensions were similar 
for CCR5* Tyl and CRTH2™ Ty2 cells (Fig. 2b), which rules out 
spatial constraints as an explanation for increased crosslinking effi- 
ciencies in CCR5* Ty] cells. Because the IFNG loci on homologous 
chromosomes were separated by at least 1.5 1m in three-dimensional 
fluorescence in situ hybridization (3D-FISH) experiments (Fig. 2b), 
and long-range interactions in the IFNG genomic region were readily 
detectable in purified G1 cells (that is, before DNA replication and the 
formation of sister chromatids, Fig. 2c), the interactions recorded by 
3C probably occurred in cis, not in trans between IFNG alleles. These 
data establish that CTCF and cohesin sites form the topological basis 
for cell-type-specific, long-range interactions at IFNG that occur pre- 
ferentially in human CCR5* Ty cells. 

We next explored the relationship of CTCF and cohesin with long- 
range chromosomal interactions. Depletion of CTCF by short interfer- 
ing RNA (siRNA)-mediated knockdown in CCR5* Ty] cells resulted 
in reduced 3C interactions between the IFNG locus (+ 1.5 kb) and the 
—63 and +119kb sites (Supplementary Fig. 4), reminiscent of the role 
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for CTCF in long-range interactions at other loci’*’’. Importantly, 


RAD21 knockdown substantially reduced long-range interactions of 
the IFNG coding region with the —63 and the +119 kb sites (Fig. 3a), 
demonstrating that the conformation of the locus depended on cohesin 
as well as CTCF. ChIP analysis of the IFNG region in CCR5~ Ty] cells 
showed that RAD21 RNA interference (RNAi) reduced the binding of 
RAD21 to its sites at IFNG, whereas CTCF remained bound (Fig. 3b). 
This result indicates that CTCF cannot maintain long-range interac- 
tions at IFNG in the absence of cohesin. Control experiments 
established that RAD21 knockdown did not interfere with technical 
aspects of the 3C methodology (Supplementary Fig. 2) or with cell cycle 
progression of human CCR5* Tyl cells (Supplementary Fig. 5). 
Notably, basal INFG transcript levels were reduced by half (50 + 13% 
(mean + s.d.), 7 = 5, P< 0.005, one-sided t-test) in RAD21-depleted 
CCRS5* Ty1 cells (Fig. 3c), and the inducibility of INFG transcripts 
after T-cell activation was reduced to 76 + 12% ofcontrol levels (Fig. 3c, 
n= 4, P<0.05). These effects were selective because the inducibility of 
IL2 was not compromised (115 + 30%, n = 3), and the expression of 
IGF2, which is restricted to the paternal allele by cohesin’’, was 
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Figure 3 | Long-range chromosomal interactions at IFNG require cohesin. 
a, CCR5* Tyl cells were transfected with control (red) or RAD21 (light 
blue) siRNA oligonucleotides. Long-range interactions at IFNG were 
assessed by 3C (mean = s.e.m., n = 4 for —63, —38 and +119 kb, n = 2 for 

98, —28, +42, +78 and +142 kb). Data are expressed relative to +119 kb 
in control-siRNA-treated CCR5* cells after normalization to mouse Acta2 
in admixed mouse chromatin, which also served to control for technical 
aspects of the 3C methodology (Supplementary Fig. 2). Normalization to 
human ACTA2 gave similar results (Supplementary Fig. 3b). **P < 0.05, 
***P < 0.005; Student’s t-test. The expression of CTCF and RAD21 protein 
(right) was assessed by immunoblotting 72 h after transfection of CCR5~ 
Ty] cells with RAD21 or control siRNA. Lamin is shown as a control. b, ChIP 
for RAD21 (top) and CTCF (bottom) occupancy of IFNG 72h after 
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transfection of CCR5* Ty1 cells with control (red) or RAD21 siRNA (light 
blue). Primer positions are indicated in kb (mean and s.e.m., n = 2). ¢, Real- 
time RT-PCR analysis of CCR5~ Tyl cells 72h after transfection with 
RAD21 siRNA or control siRNA. Top panel, RAD21 and CTCF are shown as 
knockdown controls (mean and s.d., n = 5). Middle panels, levels of IEFNG 
are shown 1 week (basal, n = 5) or 4h (induced, n = 4) after T-cell activation 
with anti-CD3 and anti-CD28 antibodies. IL2 is shown as an activation 
control (n = 3) and IGF2 as a known cohesin target (n = 3). Data are 
normalized to GAPDH, HPRT and TBP expression and shown relative to 
control-siRNA-transfected CCR5 cells. Bottom panels, enzyme-linked 
immunosorbent assay (ELISA) for basal (n = 4) and activation-induced 

(n = 4) IFN-y protein secretion. All data in ¢ are mean and s.d. 
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increased (247 + 148%, n= 3, P<0.05) in RAD21-depleted CCR5* 
Ty! cells (Fig. 3c). The basal and inducible expression of IFN-y protein 
were also reduced (to 48 + 9%, n= 4, P<0.05, and 57+7%, n= 4, 
P<0.05) by RAD21 knockdown (Fig. 3c). Hence, cohesin is required 
for the CCR5* Ty] cell-specific conformation of the IFNG locus and 
contributes to its regulation. 

Taken together, these data show that the remodelling of the devel- 
opmentally regulated IFNG locus during the differentiation of naive 
CD4 T cells into Ty] cells is accompanied by the enhanced asso- 
ciation of CTCF and cohesin with a select set of conserved sequence 
elements. CTCF and cohesin—perhaps in collaboration with other 
factors or modifications—mediate T};1-specific long-range interac- 
tions between these sites. Recent data provide support for an ‘archi- 
tectural role’ of CTCF/cohesin sites in chromosome conformation”’. 
Our data show that long-range interactions mediated by cohesin can 
be dynamically regulated in a developmental context, rather than 
static in nature. An attractive hypothesis is that CTCF recruits 
cohesin to specific sites’? '’, and that cohesin in turn mediates 
chromosomal interactions between these sites in cis (this study). 
This new aspect of cohesin function is probably of general impor- 
tance in defining local and global genome topology. Cohesin- 
mediated chromosomal cis-interactions provide one possible 
mechanism for the impact of cohesin on the regulation of H19/ 
IGF2 (refs 9, 11 and J.-M. Peters and A. Murrell, personal commun- 
ication), of cytokine loci such as IL4, IL5, IL13 and IFNG”, MHC 
class II genes’” and of other loci””’ that depend on CTCF/cohesin for 
their appropriate expression. We speculate that by constraining the 
physical conformation of chromatin, cohesin may affect the 
probability with which gene regulatory elements interact with each 
other, both negatively and positively, in normal development*” and 
in disease’. 


METHODS SUMMARY 


Naive human CD4 T cells, CCR5* Ty] and CRTH2* Ty2 effector memory cells 
were isolated by flow cytometry as described*’. ChIP and real-time PCR, RNA 
isolation, RT-PCR, siRNA transfection (Amaxa) and immunoblotting have 
been described'®. As detailed in the full Methods, human T cells were mixed 
5:1 with mouse B3 pre-B cells'®, fixed, and isolated nuclei were digested with 
HindIII at >80% efficiency at each HindIII site tested. Religated restriction 
fragments were assayed by TaqMan PCR using custom dual-labelled fluorogenic 
probes (Sigma). 3C primers for mouse Acta2, human ACTA2 and IFNG were 
designed and tested as detailed in Supplementary Fig. 2. Data were normalized to 
mouse Acta2 or human ACTA2 as indicated. 3D-FISH and distance measure- 
ments with Volocity software (Perkin-Elmer/Improvision) have been 
described*’. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 

Cell sorting and culture. The use of human cells was approved by the local 
National Health Service Trust Research Ethics Committee. CD4 T cells were 
enriched with CD4 microbeads (Miltenyi Biotech). Naive cbD4* CD45RA* 
(ImmunoTech, Beckman Coulter) and CCR5* Tyl (anti-CCR5, BD 
Biosciences) or CRTH2* Ty2 (anti-CD294, Miltenyi Biotech) effector memory 
cells were isolated by flow cytometry, activated with plate-bound anti-CD3 and 
anti-CD28 for 4days, and expanded in media containing IL2 as described’’. 
Naive cells were maintained in a non-polarized state by the addition of anti- 
IL4 and anti-IL12 (eBiosciences). Cytokine expression was induced with plate- 
bound anti-CD3 and anti-CD28, and evaluated by intracellular staining, real- 
time RT-PCR or ELISA (IFN-y Quantikine, R&D Systems). 

Real-time RT-PCR. RNA was isolated using RNAbee (Tel-Test) and reverse 
transcribed. PCR reactions included 2X SYBR PCR Master Mix (Qiagen), 
300 nM primers and 2 pl of complementary DNA as a template in 50 ul reaction 
volume. Cycle conditions were 94 °C for 8 min, 40 cycles of 94 °C for 30s, 55 °C for 
30s, 72 °C for 1 min, followed by plate read. All primers amplified specific cDNAs 
with at least 95% efficiency. Data were normalized against the average of three 
housekeeping genes, GAPDH, HPRT and TBP. Primer sequences were (5’-to-3’): 
GAPDH forward (fw) TCTGCTCCTCCTGTTCGACA, reverse (rev) AAAAG 
CAGCCCTGGTGACC; HPRT fw TCCTTGGTCAGGCAGTATAATCC, rev 
GTCAAGGGCATATCCTACAACAAA; TBP fw CCGGCTGTTTAACTTCGC 
TT, rev TTTTCCTAGAGCATCTCCAGCAC; IFNG fw GTTTTGGGTTCTCT 
TGGCTGTTA, rev ATTATCCGCTACATCTGAATGACCT; IL4 fw GCGATA 
TCACCTTACAGGAGATCA, rev GIGTTCTTGGAGGCAGCAAAG; IL2 fw 
AACACAGCTACAACTGGAGCATTT, rev AAATGTGAGCATCCTGGTGAG 
TTT; IGF2 fw ACACCCTCCAGTTCGTCTGT, rev CGGAAACAGCACTCCT 
CAAC; CTCF fw GGGGAAAATGGAGGAGAAAC, rev GCTCCTCCTCATCC 
TCATTG; RAD21 fw TGACTTTGATCAGCCACTGG, rev RT TCCCAACTTC 
TTICTCTCATGG. 

Chromatin immunoprecipitation. Formaldehyde (1%) was added to the 
culture medium at room temperature, blocked after 10 min with 0.125 M gly- 
cine, and cells were washed in cold PBS. Nuclear extracts were prepared and 
pellets were lysed in 150 mM NaCl, 25 mM Tris-HCl (pH 7.5), 5 mM EDTA, 1% 
Triton-X, 0.1% SDS, 0.5% deoxycholate for 30min on ice. Chromatin was 
sonicated to an average fragment size of 500 bp, centrifuged to pellet debris, 
incubated at 4°C with BSA- and salmon sperm-blocked magnetic PGS beads 


nature 


(Dynal) for 2h, and then overnight with PGS beads coated with rabbit anti- 
RAD21 (Abcam) or rabbit anti-CTCF (Upstate Biotech). The beads were 
sequentially washed for 5 min with low-salt buffer (0.1% SDS, 1% Triton-X, 
2mM EDTA, 20mM Tris-HCl, 150mM NaCl), high-salt buffer (0.1% SDS, 
1% Triton-X, 2mM EDTA, 20 mM Tris-HCl, 500 mM NaCl) and LiCl buffer 
(0.25 M LiCl, 1% NP-40, 1% deoxycholate, 1mM EDTA, 10 mM Tris-HCl). 
Chromatin was RNase treated and extracted in elution buffer (1% SDS, 0.1 M 
NaHCO3). DNA crosslinks were reversed at 65°C overnight, samples were 
phenol-chloroform extracted and assessed by real-time PCR. 

siRNA transfection and chromosome conformation capture. Human T cells 
were electroporated (Amaxa) with siRNA oligonucleotides and siGLO-red 
transfection indicator (Dharmacon) or siGLO-red alone using the Amaxa 
human T cell Nucleofector kit according to the manufacturer’s protocol 
(Amaxa Biosystems). Viable siGLO-red-positive cells were isolated by cell sort- 
ing and used for ChIP or 3C experiments. For 3C, 10” human T cells were mixed 
with 2 X 10° mouse B3 pre-B cells°. Formaldehyde (1%) was added to the culture 
medium at room temperature, blocked after 10 min with 0.125 M glycine, and 
the cells were washed with cold PBS before lysis in 10 mM Tris (pH 7.5), 10 mM 
NaCl, 5mM MgCl, 0.2% NP40 for 20 min on ice. Nuclei were washed and 
agitated at 37°C for 1h in 500 ul of 1.2X NEB buffer 2 with 0.3% SDS, and 
for 1h in 1.2 NEB buffer 2 with 2% Triton-X, and overnight with 1,500 units 
HindIII followed by 30 min at 65 °C in 1.5% SDS. Samples were diluted into 
1.1X T4 DNA ligase buffer (NEB) containing 1% Triton-X, and incubated at 
37°C for 1h. T4 DNA ligase was added (800 units) at 16°C for 4h. Crosslinks 
were reversed at 65 °C overnight and DNA was purified by phenol-chloroform 
extraction. Real-time PCR was performed in triplicate using TaqMan master mix 
(Qiagen) and custom dual-labelled fluorogenic probes (Sigma). 3C primers for 
human ACTA2 and IFNG, as well as mouse Acta2, were designed and tested as 
detailed in Supplementary Fig. 2, and 3C data were normalized to mouse Acta2 
or to human ACTA2 as indicated. For 3C analysis of cells in the G1 phase of the 
cell cycle, cells were fixed as described earlier and stained with 50g ml! 
propidium iodide in the presence of 0.05% NP40 and 1.5 mg ml | RNaseA for 
1h at room temperature and G1 cells were sorted by flow cytometry. Control 
cells were left unsorted. 

3D-FISH. 3D-FISH and distance measurements with Volocity software (Perkin- 
Elmer/Improvision) were performed as described*® using the IFNG BAC 
CTD2532A1 as a probe. 
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Untangling the protein web 


Researchers have identified thousands of macromolecular interactions within cells. But, as Nathan Blow 
finds out, joining them up in networks and figuring out how they work still poses a big challenge. 


In the spring of 2006, Andrew Emili and Jack 
Greenblatt from the University of Toronto 
in Canada and their colleagues published 
a survey’ of the global landscape of protein 
complexes within the yeast Saccharomyces 
cerevisiae in Nature. In the same issue, another 
group of researchers from the drug research 
company Cellzome in Heidelberg, Germany, 
also reported” on Saccharomyces protein com- 
plexes. “Those two data sets overlapped nicely, 
but by no means perfectly,” says Mike Tyers, 
a systems biologist at the University of Edin- 
burgh, UK. “And yet it was essentially the same 
method and same organism?” 

Greenblatt thinks that the two studies high- 
light something important that is emerging 
from the current crop of large-scale protein- 
protein interaction studies. “If you combine 
data sets you have more information than from 
any one study alone,’ he says. This is not to say 
that one such study is right and the other is 
wrong: scientists suspect it is more likely that 
one study often compensates for another's false 
negatives, revealing true protein interactions 
that can be missed during a single screen. 

“T think the interaction space is very large. 
Part of the issue is that there is a large range 
of interaction affinities, and as you start to get 
down into the weaker interactions those are 
tougher to detect,” says Tyers. He adds that 
identifying such “moving targets” is not like 
sequencing DNA, which can be argued to bea 
more stable target for researchers to aim at. 

But the jigsaw pieces are starting to pile up 
as researchers generate more and more genetic, 
metabolic and protein-interaction data sets 
using a diverse array of technologies. This 
work has been aided in recent 
years by a number of improved 
methods and techniques. Add 
to this recent refinements in 
computational tools for mod- 
elling signalling pathways and 
it’s clear that scientists might 
be on the cusp of changing the 


way they look at signalling and © 
information flow in cells. ® 
Embracing diversity wes 


“T think genetic information lays 
out the blueprints, whereas pro- 
teomics is much closer to what is 
going on in the cell, a molecular 
manifestation of a phenotype,” 
says Mike Snyder, a biologist at 
Yale University. When it comes 
to cataloguing proteins and 
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Mike Snyder has used protein arrays to explore 
the yeast interactome. 


their interactions, researchers are learning 
to embrace experimental diversity. “Every 
approach will usually give an overlapping but 
distinct set of information,’ says Snyder. “They 
all have their strengths and weaknesses.” 
Tyers and Greenblatt are in a growing group 
of investigators who are advancing the use of 
affinity-purification chromatography followed 
by mass spectrometry to uncover protein inter- 
actions in different cell types. In this approach, 
a protein of interest is tagged with a label that 
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Pathway maps illustrate the complexity of cellular interactions. 
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can be used for affinity purification. Although 
some scientists suspect weaker-interacting 
protein pairs or transient interactions could be 
lost during purification, Greenblatt — whose 
lab relies on tandem affinity purification tags 
in their purifications — says this is where the 
use of mass spectrometry helps out. “Mass 
spectrometry is very sensitive, so even if you 
lose 90% of the interactor during the affinity 
purification you can still detect the 10% that 
is left? he says. 

As with the technologies behind protein- 
protein analysis, researchers are finding that no 
single labelling tag may be enough to isolate all 
proteins. Tyers’s group recently reinterrogated 
a section of the yeast proteome using three 
different tags, each with different properties. 
“For a number of baits we queried, it made a 
difference what tag was on it,” he says. “Tags 
can certainly affect the recovery of interactions, 
consistent with the well-known genetic effects 
often caused by different tags.” 

Oncea specific protein or protein complex is 
purified, it is analysed with mass spectrometry. 
Electro-spray ionization or matrix-assisted 
laser desorption ionization (MALDI) volatil- 
izes and ionizes peptides, which are analysed 
on orthogonal or quadrupole time-of-flight 
(Q-TOF) instruments to identify ions with high 
mass-to-charge ratio values. Here research- 
ers have benefited greatly from advances by 
instrument developers. During the American 
Society for Mass Spectrometry annual confer- 
ence in Philadelphia, Pennsylvania, in June, 
Bruker Daltonics of Billerica, Massachusetts, 
announced its new ultrafleXtreme MALDI 
TOF/TOF system, and Thermo Fisher Scien- 
tific of Waltham, Massachusetts, 
introduced the LTQ Velos and 
LTQ Orbitrap Velos devices. 
Alongside other hardware, such 
as the Xevo Q-TOF from Waters 
in Milford, Massachusetts, and 
\) the 6500 series of Q-TOF instru- 


ee ments from Agilent Technologies 
nae in Santa Clara, California, these 
machines have improved both the 

neta 


dynamic range and sensitivity of 
mass analysis; in many cases they 
also feature integrated upstream 
separation technologies and 
© improved databases, all of which 
is making it easier to define a 
sample’s protein composition. 
For additional detail in the analy- 
sis, protein complexes can also be 
analysed with tandem mass spec- 
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trometry, in which selected precursor ions can 
be smashed into one another to produce still 
smaller fragments for analysis. 

“The big advantage of using mass 
spectrometry is that it can be per- 
formed in a physiological con- 
text? says Tyers. Unlike other 
methods for surveying protein- 
protein interactions, mass spectro- 
metry can be done on cell lines or 
even tissue samples, so indirect 
interactions that depend on more LA 
than two proteins or on post-  * “yo 
translational protein modifica- 
tions can be uncovered. Still, some 
researchers suggest that although affinity 
purification followed by mass spectrom- _, 
etry gives important information on how 
proteins interact in complexes, the approach 
does not reveal everything about the nature 
and mechanics of those interactions. 


Yeast shows the way 


hybrid assay, can provide different protein 
interaction information, according to Marc 
Vidal, a geneticist at the Dana—Farber Cancer 
Institute in Boston, Massachusetts. Vidal uses 
the analogy of two football teams facing each 
other with referees in the middle of the field to 
explain the differences between the techniques. 
“The pull-down mass-spectrometry approach 
will show you the players, referees and field, 
but not who is passing to whom and in what 
direction the ball is travelling,’ he says. “This 
is where a binary approach comes in.” 

The yeast two-hybrid assay is arguably the 
best-known binary approach. It relies on a 
split transcription factor in which one portion 
is placed on each of the two proteins being 
tested for interaction. If the proteins interact, 
the transcription factor will be regenerated and 
a reporter gene transcribed, providing a read- 
out. The assay allows for more the testing of 
dynamics of protein-protein interactions, such 
as dissociation rates. “Physical interactions 
are not everything: you need both edges and 
arrows to know the dissociation rates as well as 
other logical aspects of the relationships. Pull- 
down mass spectrometry is a little short when 
it comes to those interactions,’ says Vidal. 

The other advantage of the yeast two- 
hybrid approach is that it presents a more 
high-throughput solution to studying protein 
interactions. “The two-hybrid approach is rea- 
sonably high-throughput,’ says Snyder, noting 
that with robotics a large number of proteins 
can be tested for potential interactions in a 
two-by-two format. 

Other approaches have also been rising to 
the surface. “From the probing that we have 
done, we have picked up interactions that you 
definitely do not see with other methods,’ says 
Snyder of his experience using protein micro- 
arrays to explore protein-protein interactions. 
Protein arrays, which are sold by a number of 
companies including Invitrogen in Carlsbad, 
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Caenorhabditis elegans interactome map, showing 
Binary approaches, such as the yeast two- 5,500 protein interactions among 3,000 proteins. 


California, RayBiotech in Norcross, Georgia, 
and R&D Systems in Minneapolis, Minne- 
sota, have not been used as often for large- 
scale protein-interaction studies as either 
mass spectrometry or the binary-interaction 
approaches. “They have had impact in certain 
areas. Part of the problem is that they have 
been somewhat expensive, which might be the 
reason that they have not caught on as much 
for large-scale studies,” says Snyder. Given the 
potential of protein microarrays to identify 
unique interactions, he hopes that costs will 
fall, which could increase their use in large- 
scale interaction studies. 

An orthogonal approach to the yeast two- 
hybrid assay for detecting protein-protein 
interactions is the protein-fragment com- 
plementation assay (PCA), in which two 
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proteins of interest are attached to complemen- 
tary fragments ofa reporter protein. If the pro- 
teins interact with one another the reporter 
is regenerated providing a direct read-out 
that is not dependent on transcription 

of another gene as in the yeast-two- 

° hybrid assay. Steven Michnick and 
H» his colleagues used the PCA 
approach last year’ to explore 


* reo 
awe ¢ the yeast-protein interactome, 
2 ~.__,* identifying nearly 2,800 interac- 
a, 


tions among 1,124 proteins, many 
= of which had not previously been 
. _ identified by other approaches. 
Additional work and tools could be 
needed to define a complete interaction 
map for even the most well-characterized 
organisms. Snyder suspects that in yeast 
each protein ‘sees’ about five other pro- 


the interactions identified for yeast, which has 
around 6,000 proteins, add up to far fewer than 
the potentially 30,000 predicted. “So, there is 
still a way to go,’ he says. 


Clear pathways 
Finding which macromolecules interact is 
only the first step to figuring out signalling 
pathways. Researchers also need methods to 
assemble those interactions into cellular net- 
works, which is where bioinformatics enters 
the picture. “It is like building a bicycle — you 
have the wheels, a seat and handlebars, but we 
provide the steps to put the parts together,’ says 
Julie Bryant, vice-president of business devel- 
opment at GeneGo in St Joseph, Michigan, 
a company specializing in the development 
of software for cell-signalling and metabolic 
analysis. GeneGo is not alone here: a growing 
number of developers are creating tools for the 
analysis of signalling networks — from those 
that build model networks based on existing 
data to systems that use data sets and models 
to make predictions about the activity of dif- 
ferent signalling networks. 

“We can take in any kind of experimental 
data — genomic, proteomic, metabolomic — 
and overlay them on cell-signalling pathways,” 


Advances in mass spectrometry technology are benefiting protein-protein interaction studies. 
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teins on average. But at the moment all of 
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Different approaches for identifying protein-protein interactions often reveal unique information. 


says Bryant, describing GeneGo’s MetaCore 
software. Being able to overlay a variety of dif- 
ferent experimental data from different sources 
requires careful database curation, she says. At 
the moment, GeneGo employs 50 scientists to 
manually mine and curate published literature 
for studies on protein interaction, gene expres- 
sion, metabolism and drugs to expand and 
update its internal database, which now con- 
tains more than 120,000 multi-step interaction 


pathways, each averaging 11 steps, with infor- 
mation on direction, mechanism and feedback 
along the pathways, along with direct links to 
literature evidence. 

Literature mining is important for building 
larger interaction databases, but Bryant says it 
can be especially difficult if the experimental 
descriptions underlying the results have not 
been published. Another problem, according 
to Vidal, is that researchers sometimes have 


“sociological” biases in terms of which proteins 
and interactions they will work on and report. 
“We have learned a lot about the rules of how 
macromolecules interact, but when you ask 
how much of the network we have, or what the 
size of the interactome ofa particular species is, 
if you only used the literature it would be tough 
to answer those questions,” he says. 

Tyers is involved with the publicly funded 
BioGRID (Biological General Reposi- 
tory for Interaction Datasets) initiative, an 
internationally curated database of molecular 
interactions. Three years ago, there was an effort 
to back-curate all the yeast literature for protein 
and genetic interactions, but now the database 
contains protein-interaction data from yeast, 
worms, flies, plants and even humans along 
with some genetic-interaction data as well. For 
Tyers, the goal is to accurately mirror the pri- 
mary literature and distil it into a format that 
can be used in network biology. “We make no 
judgement calls on the method or even, within 
reason, the quality of the data themselves,” he 
says, giving researchers the opportunity to 


PLAYING BY THE RULES 


When researchers at Plectix 
BioSystems in Somerville, 
Massachusetts, began to use 
their new Cellucidate software to 
model the epidermal growth factor 
receptor pathway, they calculated 
that there were 10® potential 
states — including all protein 
complexes and phosphorylation 
states — for the system. “This 
is the kind of complexity that 
scientists have to grapple with 
when it comes to cell-signalling 
networks,” says Gordon Webster, 
vice-president of biology at Plectix. 

Although not all these potential 
states necessarily occur in 
that pathway, when it comes 
to creating more manageable 
models for understanding cell 
signalling researchers face a 
difficult question: what interaction 
data do they use in their models? 
Although many commercial and 
public databases still rely heavily 
on the small-scale protein-protein- 
interaction studies that appear 
in peer-reviewed literature, the 
emergence of high-throughput 
experimental approaches that 
generate very large interaction 
data sets is creating the need for a 
new set of rules. 

“In practice, what comes out of 
these high-throughput studies 
is nota yes/no thing — ‘these 
interact, and these don't’ — but 
in fact they generate a list of 


interactions and associated 
probabilities,” says Jack 
Greenblatt from the University of 
Toronto in Canada. To generate 
such probabilities for his mass- 
spectrometry studies, Greenblatt 
applied a ‘gold standard’ for 
protein interactions — a set of 
protein complexes or interactions 
in which there is a strong amount 
of confidence according to the 
literature — as well asa set of 
proteins not known to interact 
with one another as a negative 
standard. He then tackled the 
question of whether or not 

data sets generated by mass 
spectrometry stacked up against 
protein-interaction reports seen in 
peer-reviewed literature. 

“What we did in the end was 
to use the same gold standard 
to look at the molecular-biology 
literature,” says Greenblatt. After 
adjusting the cut-off point so that 
the average confidence score 
froma high-throughput study 
matched the confidence score 
of interactions reported in the 
literature, he says the interaction 
data from such studies are no 
better or worse than what is in 
the literature. 

Marc Vidal, a geneticist at the 
Dana-Farber Cancer Institute in 
Boston, Massachusetts, wants 
to see a similar approach taken 
with yeast two-hybrid and other 


Graphical representation of the current budding-yeast interaction network. 


binary screens. “Let's roll up our 
sleeves and decide ona positive 
and negative gold standard,” he 
says. “But let's also use orthogonal 
assays to give confidence scores to 
the interactions.” 

In January, Vidal and his 
colleagues published a series of 
papers®° suggesting the use of 
new binary interaction assays to 
build confidence in basic networks 
produced using yeast two-hybrid 
data sets. “You say ‘OK, this is 
basic network’ and then push 
that into a framework where 
all interactions are going to be 
tested by two or three orthogonal 
assays. And not only that, but do 
that under conditions where you 
have a positive and negative gold 
standard,” says Vidal, adding 
that the high-scoring interactions 
can then serve as hypotheses for 
researchers to test. 
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Whether or not these efforts and 
standards will lead researchers 
to rely more on large-scale data 
sets and mine them more deeply 
will only be known in time. For 
some, even with confidence 
measures, large-scale data sets 
lack information often found 
in smaller studies. “This is one 
of the paradoxes that we find 
when people talk about systems 
biology. With technology it is very 
easy to generate spreadsheets 
of interaction data, but that 
alone does not represent any 
knowledge,” says Webster. 

But for Greenblatt and others, 
large-scale data sets represent 
a starting point for further 
research efforts. “To me, high- 
throughput studies are just like the 
conventional literature,” he says, 
“providing a gold mine for people 
to dig into.” N.B. 
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Cell-signalling software packages allow researchers to model and test cellular interaction networks. 
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extract the maximum amount of information. 
A different angle in modelling signalling 
networks was recently described by Walter 
Fontana from Harvard University and his col- 
leagues’. It uses sets of rules to define relation- 
ships between cellular components instead of 
the more conventional method of defining 
specific interactions and species using differen- 
tial equations. Fontana co-founded a company 
called Plectix BioSystems in Somerville, Mas- 
sachusetts, which has employed this approach 
in a web-based system called Cellucidate. 
“The system is represented at a very granular 
level where the participants are allowed to do 
in silico what they would do in real life,’ says 
Paul Edwards, chief executive at Plectix. Imag- 
ine the city-building computer game SimCity 
reworked for complex cellular networks, but 
here the agents of the cell — proteins and other 
molecules — are the automata instead of col- 
ourful animated people. “In that way the model 
mirrors the behaviour of the living system it 
represents: the biology that emerges from our 


Biomek FX 


models is the combinatorial expression of all 
these automata doing their own little thing — 
just the way itis in the cell,” says Gordon Web- 
ster, vice-president of biology at Plectix. 


Complexity from simplicity 
According to Edwards, the advantage of the 
Cellucidate approach is that a simple set of 
rules for each agent can result in complex bio- 
logical behaviour when agents interact during 
the course of a simulation, unlike modelling in 
other formats, where the complexity has to be 
defined before a simulation can be executed. 
“The level of granularity also means that rules 
and agents can be easily recycled from one 
model to another,’ he notes. Like the GeneGo 
platform and the BioGRID initiative, Plectix 
relies on literature mining from various sets of 
experimental data to create the rules for a model 
system (see ‘Playing by the rules; page 417). 
“Mapping all interactions is important, but 
so is understanding the dynamics behind those 
interactions,’ says Snyder. To understand the 


Mike Tyers uses mass spectrometry to identify protein-protein interactions. 


418 


© 2009 Macmillan Publishers Limited. All rights reserved 


dynamics of the information flow in cells, 
researchers not only need more knowledge of 
protein-protein interaction networks, but they 
also need to understand protein-DNA interac- 
tions, the effects of microRNAs and epigenetic 
changes on gene expression, and how other 
macromolecules such as metabolites affect the 
output of signalling networks. “It is the whole 
system together that determines the final out- 
put and activity,’ says Snyder. 

Vidal thinks that technological improve- 
ments — especially in nanotechnology, to 
generate more data, and microscopy, to explore 
interaction inside cells, along with increased 
computer power — are required to push sys- 
tems biology forward. “Combine all this and 
you can start to think that maybe some of the 
information flow can be captured,” he says. 

But when it comes to figuring out the best 
way to explore information flow in cells, Tyers 
jokes that it is like comparing different degrees 
of infinity. “The interesting point coming out 
of all these studies is how complex these sys- 
tems are — the different feedback loops and 
how they cross-regulate each other and adapt 
to perturbations are only just becoming appar- 
ent, he says. “The simple pathway models 
are a gross oversimplification of what is 
actually happening.” 

Paul Nurse of Rockefeller University in New 
York wrote about understanding the cell’s 
information flow last year’. He noted that “our 
past successes have led us to underestimate the 
complexity of living organisms’, an oversight 
that is rapidly disappearing within the world 
of systems biology and will probably never 
happen again. a 
Nathan Blow is technology editor for Nature 
and Nature Methods. 
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Different approaches for identifying protein-protein interactions often reveal unique information. 


says Bryant, describing GeneGo’s MetaCore 
software. Being able to overlay a variety of dif- 
ferent experimental data from different sources 
requires careful database curation, she says. At 
the moment, GeneGo employs 50 scientists to 
manually mine and curate published literature 
for studies on protein interaction, gene expres- 
sion, metabolism and drugs to expand and 
update its internal database, which now con- 
tains more than 120,000 multi-step interaction 


pathways, each averaging 11 steps, with infor- 
mation on direction, mechanism and feedback 
along the pathways, along with direct links to 
literature evidence. 

Literature mining is important for building 
larger interaction databases, but Bryant says it 
can be especially difficult if the experimental 
descriptions underlying the results have not 
been published. Another problem, according 
to Vidal, is that researchers sometimes have 


“sociological” biases in terms of which proteins 
and interactions they will work on and report. 
“We have learned a lot about the rules of how 
macromolecules interact, but when you ask 
how much of the network we have, or what the 
size of the interactome ofa particular species is, 
if you only used the literature it would be tough 
to answer those questions,” he says. 

Tyers is involved with the publicly funded 
BioGRID (Biological General Reposi- 
tory for Interaction Datasets) initiative, an 
internationally curated database of molecular 
interactions. Three years ago, there was an effort 
to back-curate all the yeast literature for protein 
and genetic interactions, but now the database 
contains protein-interaction data from yeast, 
worms, flies, plants and even humans along 
with some genetic-interaction data as well. For 
Tyers, the goal is to accurately mirror the pri- 
mary literature and distil it into a format that 
can be used in network biology. “We make no 
judgement calls on the method or even, within 
reason, the quality of the data themselves,” he 
says, giving researchers the opportunity to 
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When researchers at Plectix 
BioSystems in Somerville, 
Massachusetts, began to use 
their new Cellucidate software to 
model the epidermal growth factor 
receptor pathway, they calculated 
that there were 10® potential 
states — including all protein 
complexes and phosphorylation 
states — for the system. “This 
is the kind of complexity that 
scientists have to grapple with 
when it comes to cell-signalling 
networks,” says Gordon Webster, 
vice-president of biology at Plectix. 

Although not all these potential 
states necessarily occur in 
that pathway, when it comes 
to creating more manageable 
models for understanding cell 
signalling researchers face a 
difficult question: what interaction 
data do they use in their models? 
Although many commercial and 
public databases still rely heavily 
on the small-scale protein-protein- 
interaction studies that appear 
in peer-reviewed literature, the 
emergence of high-throughput 
experimental approaches that 
generate very large interaction 
data sets is creating the need for a 
new set of rules. 

“In practice, what comes out of 
these high-throughput studies 
is nota yes/no thing — ‘these 
interact, and these don't’ — but 
in fact they generate a list of 


interactions and associated 
probabilities,” says Jack 
Greenblatt from the University of 
Toronto in Canada. To generate 
such probabilities for his mass- 
spectrometry studies, Greenblatt 
applied a ‘gold standard’ for 
protein interactions — a set of 
protein complexes or interactions 
in which there is a strong amount 
of confidence according to the 
literature — as well asa set of 
proteins not known to interact 
with one another as a negative 
standard. He then tackled the 
question of whether or not 

data sets generated by mass 
spectrometry stacked up against 
protein-interaction reports seen in 
peer-reviewed literature. 

“What we did in the end was 
to use the same gold standard 
to look at the molecular-biology 
literature,” says Greenblatt. After 
adjusting the cut-off point so that 
the average confidence score 
froma high-throughput study 
matched the confidence score 
of interactions reported in the 
literature, he says the interaction 
data from such studies are no 
better or worse than what is in 
the literature. 

Marc Vidal, a geneticist at the 
Dana-Farber Cancer Institute in 
Boston, Massachusetts, wants 
to see a similar approach taken 
with yeast two-hybrid and other 
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binary screens. “Let's roll up our 
sleeves and decide ona positive 
and negative gold standard,” he 
says. “But let's also use orthogonal 
assays to give confidence scores to 
the interactions.” 

In January, Vidal and his 
colleagues published a series of 
papers®° suggesting the use of 
new binary interaction assays to 
build confidence in basic networks 
produced using yeast two-hybrid 
data sets. “You say ‘OK, this is 
basic network’ and then push 
that into a framework where 
all interactions are going to be 
tested by two or three orthogonal 
assays. And not only that, but do 
that under conditions where you 
have a positive and negative gold 
standard,” says Vidal, adding 
that the high-scoring interactions 
can then serve as hypotheses for 
researchers to test. 
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Whether or not these efforts and 
standards will lead researchers 
to rely more on large-scale data 
sets and mine them more deeply 
will only be known in time. For 
some, even with confidence 
measures, large-scale data sets 
lack information often found 
in smaller studies. “This is one 
of the paradoxes that we find 
when people talk about systems 
biology. With technology it is very 
easy to generate spreadsheets 
of interaction data, but that 
alone does not represent any 
knowledge,” says Webster. 

But for Greenblatt and others, 
large-scale data sets represent 
a starting point for further 
research efforts. “To me, high- 
throughput studies are just like the 
conventional literature,” he says, 
“providing a gold mine for people 
to dig into.” N.B. 
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COMPANY PRODUCTS/ACTIVITY LOCATION URL 

Systems biology 

Assay Designs Antibodies; ELISA; cell-signalling reagents; pathway maps Ann Arbor, Michigan www.assaydesigns.com 
Aviva Systems Biology Antibodies for the exploration of various biological pathways San Diego, California www.avivasysbio.com 
Cellumen Reagents, assays and analysis tools for exploring models of cellular disease Pittsburgh, Pennsylvania www.cellumen.com 
GeneGo Software for analysis of cell-signalling networks; interaction databases St Joseph, Michigan www.genego.com 
Genstruct Computational approach to understanding systems biology and mechanism-based drug discovery Cambridge, Massachusetts www.genstruct.com 
Optimata Computational tools for patient physiology and pathology; dynamics of drug interactions Ramat Gan, Israel www.optimata.com 
Physiomics Systems-biology approach to development of software and models of cancer Oxford, UK www.physiomics-plc.com 


Plectix BioSystems 


Software called Cellucidate for the analysis of cell-signalling pathways 


Somerville, Massachusetts 


www.plectix.com 


RayBiotech Protein and antibody arrays; ELISA; tissue microarrays Norcross, Georgia www.raybiotech.com 

R&D Systems Protein arrays, along with other molecular- and cell-biology reagents and kits Minneapolis, Minnesota www.rndsystems.com e 
Chromatography 

Advanced Separation Analytical and preparative chromatography products; HPLC and chiral chromatography Bellefonte, Pennsylvania www.astecusa.com 


Technologies 
Alcott Chromatography 


Biotage 

Capital HPLC 

Cecil Instruments 
DataApex 

GE Healthcare 
GeneTel 

Gilson 

Hamilton Company 
Phenomenex 


Tosoh Bioscience 


Chromatography software 


HPLC and CE columns and accessories 


HPLC equipment and software, ion chromatography, hydrochromatic chromatographs 


Chromatography columns and accessories; FLASH chromatography systems 


Modular HPLC instruments, ion chromatography, UV/visible spectrophotometers 


AKTAExpress automated liquid chromatography platform 


Antibodies for proteomics; antibody services 


Solid-phase extraction systems, liquid chromatography 


Automated solid-phase extraction systems 


Chromatography columns and accessories 


Liquid chromatography columns, products and accessories 


Norcross, Georgia 
Uppsala, Sweden 

West Lothian, UK 
Cambridge, UK 

Prague, Czech Republic 
Little Chalfont, UK 
Madison, Wisconsin 


iddleton, Wisconsin 


Reno, Nevada 
Torrence, California 


Stuttgart, Germany 


www.alcottchromatography.com 
www.biotage.com 
www.capital-hplc.co.uk 
www.cecilinstruments.com 
www.dataapex.com 
www.gehealthcare.com 
www.genetel-lab.com 
www.gilson.com 
www.hamiltoncompany.com 
www.phenomenex.com 


www.tosohbioscience.com 


Mass spectrometers 


Agilent Technologies 
Applied Biosystems 
Bruker Daltonics 

Cecil Instruments 

Hitachi High Technologies 
Leco 

MDS Sciex 

Shimadzu 

Thermo Fisher Scientific 


Varian 


Waters 


crystallography 


ass-spectrometry instruments, sample preparation and software 

ass-spectrometry instruments, reagents and software; ion sources 

ass spectrometers for different applications 

Spectroscopy and chromatography instruments and systems 

Liquid-chromatography systems; LC/MS systems 

ass-spectrometry instrumentation and analysis tools 

ass-spectrometry systems and technology; HPLC systems 

Laboratory instruments including mass spectrometers and data-management systems 
High-end analytical instruments, including mass spectrometers, as well as laboratory equipment, 
software, services, consumables and reagents 
Range of analytical instruments including nuclear magnetic-resonance and magnetic-resonance 


imaging systems, mass-spectrometry instruments, Fourier-transform infrared and X-ray 


Liquid-chromatography systems; chromatography columns and chemistry products; mass- 


spectrometry systems; laboratory informatics solutions 


Santa Clara, California 
Foster City, California 
Billerica, Massachusetts and 
Bremen, Germany 
Cambridge, UK 

Tokyo, Japan 

St Joseph, Michigan 
Sunnyvale, California 

Kyoto, Japan 


Waltham, Massachusetts 


Palo Alto, California 


Milford, Massachusetts 
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www.agilent.com 
www.appliedbiosystems.com 
www.bdal.de 
www.cecilinstruments.com 

www. hitachi-hitec.com 
www.leco.com 

www.mdssciex.com 
www.shimadzu.com 
www.thermo.com e 


www.varianinc.com 


www.waters.com 
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COMPANY PRODUCTS/ACTIVITY LOCATION URL 
General 
Active Motif Tools for studying epigenetic regulation; antibodies; transfection reagents; bioinformatics tools Carlsbad, California www.activemotif.com 


Alexis Biochemicals 
Attagene 

BD Biosciences 
BioGenes 

Biomol 

Bio-Rad 

BMG Labtech 
Cole-Parmer 
CombiMatrix 

EMD Biosciences 
Enzo Life Sciences 
Geneservice 
Genovac 

Hamilton Robotics 
Harvard Apparatus 
Horiba Scientific 
Integra Biosciences 
Invitrogen 

Irvine Scientific 
Lonza 

Merck 

Metrohm 
Molecular Devices 
MP Biomedicals 
Nalgene Nunc 

New England Biolabs 
Oligoengine 

PARC 

PerkinElmer 

Pierce Protein 
Premier Biosoft 
Princeton Separations 
Promega 
Stratagene 

Takara Bio 

USB 


Wako USA 


@ see advertisement 
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Reagents for molecular- and cell-biology research 

Transcription-factor profiling system; software 

Research reagents, bioimaging systems, instruments 

Antibody and immunoassay services 

Services for chemical synthesis, cell culture and antibody production 

Products, instruments and software for life-sciences research 

Microplate and array readers and handling systems 

Instruments and reagents 

Influenza detection systems and microarrays 

Calbiochem, Novabiochem and Novagen product lines 

Consumables and assays for molecular biology, gene expression and genomic analysis 
Genomic and proteomic resources; contract services for DNA sequencing, microarray 


analysis and SNP genotyping 
Gene synthesis, plasmid purification services; antibody purification and processing 


Automated liquid-handling equipment, automated robotic platforms 


Instruments and equipment for electrophysiology and cell biology 


Spectroscopy systems and accessories including Raman, atomic emission and UV spectroscopy 


Equipment for sterilization, liquid handling, cell culture and sample storage 


Kits and reagents for molecular-biology, genomics, and cell-biology research 

Defined media for cell-culture applications; custom media services 

Molecular-biology reagents and systems; advanced chemical synthesis 

Chemicals, kits and reagents for chemistry, molecular- and cell-biology-related research 
Laboratory instrument supplier; consumables 

Liquid-handling and microplate-processing equipment; imaging instruments 

Reagents and chemicals for research 

Labware 

Molecular-biology-related reagents, kits and enzymes 

RNAi-synthesis design software; RNAi vectors and clones 


Software for the analysis of glycan mass-spectrometry data sets 


nstruments, reagents and kits for life sciences 


Protein assays, purification, Western blotting 


Software for life-sciences research; SimGlycan analysis tool 
DNA-purification columns and reagents, fluorescent protein-labelling kits, 
Chemicals for mass-spectrometry analysis 

Tools and reagents for molecular biology, genomics and proteomics 
Reagents, kits and consumables for molecular biology 

Chemicals and reagents for molecular biology 


Speciality chemicals supplier; clinical diagnostic reagents 


Lausanne, Switzerland 
Research Triangle Park, North 
Carolina 

San Diego, California 

Berlin, Germany 

Hamburg, Germany 
Hercules, California 
Offenburg, Germany 


Vernon Hills, Illinois 


Mukiteo, Washington 


San Diego, California 
New York, New York 
Cambridge, UK 

Freiburg, Germany 

Reno, Nevada 

Holliston, Massachusetts 
Edison, New Jersey 

Baar, Switzerland 
Carlsbad, California 
Santa Ana, California 
Basel, Switzerland 
Darmstadt, Germany 
Westbury, New York 
Sunnyvale, California 
Aurora, Ohio 

Rochester, New York 
pswich, Massachusetts 
Seattle, Washington 

Palo Alto, California 
Waltham, Massachusetts 


Rockford, Illinois 


Palo Alto, California 


Adelphia, New Jersey 


Madison, Wisconsin 
La Jolla, California 
Shiga, Japan 
Cleveland, Ohio 


Richmond, Virginia 
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www.alexis-corp.com 
www.attagene.com 
www.bd.com 
www.biogenes.de 
www.biomol.de 
www.bio-rad.com 
www.bmglabtech.com 
www.coleparmer.com 
www.combimatrix.com 
www.emdbiosciences.com 
www.enzo.com 
www.geneservice.co.uk 
www.genovac.com 
www.hamiltonrobotics.com 
www.harvardapparatus.com 
www.horiba.com 
www.integra-biosciences.com 
www.invitrogen.com 
www.irvinesci.com 
www.lonza.com 
www.merck.de 
www.metrohmusa.com 
www.moleculardevices.com 
www.mpbio.com 
www.nalgenunc.com 
www.neb.com 
www.oligoengine.com 
www.parc.com 
las.perkinelmer.com 
www.piercenet.com 
www.premierbiosoft.com 
www.prinsep.com 
www.promega.com 
www.stratagene.com 
www.takara-bio.com 
www.usbweb.com 


www.wakousa.com 
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How did you become 
interested in taxonomy? 

I was always curious about 
living things. During my 
senior year of college, I 
showed my entomology 
professor specimens of 
sweat bees nesting in my 
driveway. Their biology was 
unknown and my professor 
encouraged me to document 
it as an undergraduate thesis 
project, which evolved into a 
PhD dissertation. 


Were you surprised by the 
appointment to the ICZN? 
Yes. ’'m an opinionated 
individual, and my opinions 
don’t go down too well 

with large segments of the 
taxonomic community. 

I think the present system 
by which we name species is 
not policed effectively and has 
loopholes and ambiguities. 
For example, scientific 
names can be published in 
journals without peer review. 
Although that freedom is 
fine, the reality effectively 
permits taxonomic vandals 
to plagiarize others or publish 
without scientific merit. 

The ICZN, a 28-member 
commission, is now trying 
to deal with this and other 


messy issues. I advocate 
substantial changes to the 
code, such as provisions for 
online-only publication, 
but others prefer the status 
quo, which makes for a 
contentious debate. 


Have you had a career- 
defining moment? 

I can’t pick one moment. 
The thrill of making new 
discoveries drives my 
work. When you work with 
insects, virtually everything 
you do is a discovery 
because so few species have 
been studied in depth. 

I have personally 
discovered at least 200 
species, which have all been 
gratifying. In my current 
position I oversee one of 
the largest collections of 
insects in North America at 
the Entomology Research 
Museum at the University of 
California, Riverside. 


What has been your 
biggest career challenge? 
Entomology as a discipline 
doesnt get much respect 
or support. Unfortunately, 
no one seems to care about 
the estimated 10 million 
undocumented life forms 


Taxonomist Douglas Yanega has been 
appointed to the International Commission on 
Zoological Nomenclature (ICZN) in London. 


on this planet. However, 

as a career choice, it is 
phenomenal. I help bring 
attention to these species. 
Unfortunately the current 
California budget turmoil 
has put my career at risk 
because I’m considered a 
staff member rather than a 
faculty member. 


Do you think your ICZN 
appointment will help your 
career survive lay-offs? 

As it is not a paid position, 
serving as a commissioner 
on the ICZN is ultimately a 
matter of prestige. But my 
standing at the university 
depends entirely on the 
interest of administrators 
overseeing my position. So I 
hope they will be sympathetic 
when assessing the nature 
and impact of my work. But 
I worry that entomology 
doesn’t garner much respect. 


What do you value most 
about the scientific 
process? 

That in the long run, reason 
does win out. As a reasonable 
person, you like to see the 
proper outcome. a 


Interview by Virginia Gewin 


POSTDOC JOURNAL 


Several years ago, while still a 
student at Cornell University 
in Ithaca, New York, | tolda 
friend that | would be spending 
six months at the University 

of Calgary in Alberta. He 

joked that | was a “mercenary 
for science”. That joke has 
become a depressing reality. 

In the past six years, I've 
lived in five locations across 
the United States and Canada. 
| feel as though | lack roots. | 
am constantly moving, living 
out of boxes, leaving friends 
and adjusting to a new town. 

It is a solitary, nomadic life. 

In those six years | have 

accrued an enviable hoard 


Wealth of knowledge? 


of scientific wealth. While 

a graduate student enrolled 
at Cornell ina department 
famous for nonlinear 
dynamics, | plundered the 
University of California, 
Berkeley, for knowledge of 
metabolic biochemistry and 
the University of Calgary 
for knowledge of muscle 
mechanics. As a postdoc, 

| raided the University of 
Vermont in Burlington, 
escaping with invaluable 
experience in experimental 
single-molecule biophysics. 
Now at Johns Hopkins 
University in Baltimore, 
Maryland, | am greedily 


absorbing the very latest in 
theoretical biophysics. But 
even with all this wealth, | just 
want a place to call home. 

In afew months, | begin 
the next round of faculty 
applications, and | think | 
need to change my mindset. 
Maybe | should stop thinking 
about how to maximize my 
knowledge and instead start 
thinking about where | want to 
live. | just hope I've amassed 
enough scientific wealth to 
afford the rent. | 
Sam Walcott is a postdoc 
in theoretical biophysics at 
Johns Hopkins University in 
Baltimore, Maryland. 
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IN BRIEF 


Energy bill to create jobs? 


Calls to create clean-energy jobs were 

a common refrain at a 7 July US Senate 
hearing for a bill pending in Congress. 
The Waxman-Markey bill aims to 
create jobs, reduce global warming 

and encourage the use and creation of 
alternative sources of energy. The House 
of Representatives passed the bill on 

26 June and the Senate is likely to 
introduce its version this autumn. 

Senator Benjamin Cardin (Democrat, 
Maryland) argued that the priority should 
be to foster green jobs. “This bill is about 
keeping jobs and expanding jobs,’ he said. 
Senator Kit Bond (Republican, Missouri) 
countered that the bill would “impose 
new energy taxes’, reduce jobs and create 
a “bureaucratic nightmare to implement a 
carbon cap-and-trade programme”. 


Californian budget woes 


More than 300 scientists from the 
University of California system are 
protesting proposed state budget cuts 
that they warn will endanger California's 
science and technology enterprise. 

In a6 July letter to Republican 
governor Arnold Schwarzenegger, the 
scientists caution that the cuts “are 
likely to destroy” the university’s status 
as “the leading public university in the 
United States”. The cuts would lead to 
salary reductions, resignations and 
would jeopardize the university's ability 
to attract new faculty members, the 
document says. 

A spokeswoman for the governor says 
the state’s US$26-billion budget deficit 
has created a “worst-case scenario” in 
which the cuts are unavoidable. 


Physicists wanted 


A boost in funding and a shift in focus to 
photon science have prompted expansion 
at the SLAC National Accelerator 
Laboratory in Menlo Park, California. 
Lisa Mongetta, the lab’s human- 
resources manager, says the lab is 
seeking more than half-a-dozen 
physicists, research scientists and 
engineers, although she warned that the 
number could change rapidly. In recent 
weeks SLAC has filled dozens of other 
positions. The expansion is a reversal 
of fortunes for the lab, which in January 
2008 announced it would shut down one 
of its colliders as a result of budget cuts 
(see Nature 451, 235; 2008). 
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Down but not out 


Indian outsourcing has been hit badly by the global downturn, but signs of growth remain. 


K. S. Jayaraman investigates. 


n economic tsunami is assailing 
the Indian shores. The country 
has been hit by the global crisis, 
financial scandal in the nation, 
terror attacks in Mumbai and US President 
Barack Obama's campaign promise to end tax 
breaks for US companies that send their work 
overseas. The result: the outsourcing industry, 
a source of Indian economic prosperity 
and national pride since the mid-1990s, has 
been hit hard. Clinical research, aerospace, 
information technology and ‘business process 
outsourcing’ (tasks such as human resources, 
customer service and accounting duties) 
have all been affected. But they all have the 
promise to weather the storm. 

Bombings in Mumbai in November 2008 
drove away potential investors, and the 
uncovering of a 71-billion-rupee (US$1.5- 
billion) fraud operation at Hyderabad-based 
Satyam Computers, India’s fourth-biggest 
software and service exporter, raised severe 
credibility issues. Satyam’s chairman, 
Ramalinga Raju, who is now in jail for his 
part in the offence, confessed to having 
falsified Satyam’s accounts for years to make 
the company’s performance seem better. 
Meanwhile, tax disincentives introduced 
by Obama in May mean trouble at Indian 
outsourcing hot spots, including Gurgaon, 
Mumbaiand Pune in the north and 
Bangalore, Hyderabad and Chennai in the 
south. 

The information technology (IT) and 
business-process outsourcing 
sector may be the biggest 
victim. On 2 June, nearly 
15,000 workers from this 
sector sent a petition to their 
trade body, the National 
Association of Software 
and Service Companies 
(NASSCOM), urging 
employers to explore creative 
cost-cutting options rather 


“India's main 
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A massive case of fraud at Satyam Computers has hit the information-technology industry hard. 


Services, based in Mumbai, and Wipro 
Technologies in Bangalore, stopped hiring 
this year. Wipro chairman Azim Premji told 
an industry meeting in Bangalore on 13 
June that Indian companies may not be able 
to hire more than one-third of the 35,000 
engineers expected to graduate in India next 
year. Shekhar predicts that at least 50,000 

.. people in the sector will lose 
their jobs by the end of the 
year. “Instead of firing them, 
the companies should try to 
enhance their skills to enable 
them to shoulder new tasks,” 
says Shekhar. 

Yet some see signs of 
hope. NASSCOM chairman 
Pramod Bhasin told the 
June NASSCOM business- 


than laying people off. attraction for the process outsourcing strategy 
Between January 2008 ; summit in Bangalore that 

and February 2009 — after drug industry — the industry will start to 

the fraud scandal broke — its huge diseased bounce back in early 2010, 


Satyam’s workforce shrank 
by 9,400 people to 41,600, 
according to Karthik 
Shekhar, secretary-general 
of the Union for IT-Enabled 
Services Professionals. The 
Delhi and Bangalore offices of IT firm HCL 
Technologies laid off 450 of their nearly 
53,000 employees, Shekhar says. Two other 
outsourcing companies, Tata Consultancy 
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population — will 
not go away.” 
— Vasudeo Ginde 


thanks to a huge domestic 
market, government-related 
outsourcing and expanded 
international reach. Bhasin 
suggests that companies 
explore new sectors and 
regions, focus more on the domestic market 
and rely less on the United States and 
Europe. Despite the slowdown, the industry 
will continue to hire more people than it 
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loses this year; it will have grown by 15% 
rather than the 30% typical of earlier years, 
says Bhasin. 


Seeking alternatives 
The sagging economy has encouraged 
young scientists and engineers to consider 
safer options, such as government jobs. The 
Indian Space Research Organisation (ISRO), 
based in Bangalore, received more than 
130,000 applications from scientists and 
engineers this year for just 300 advertised 
posts, according to ISRO spokesman 
S. Satish. And positions at the Defence 
Research & Development Organisation are 
in high demand. The organization may hire 
as many as 60 engineering graduates — 12% 
of the people it recruits this year — from the 
prestigious Indian Institutes of Technology 
compared with one or two in a typical year, 
according to its chief controller of research 
and development, Prahlada. Traditionally, 
graduates from the institutions travel abroad 
or opt for the high salaries of big companies. 
But outsourcing companies such as Ananth 
Technologies in Hyderabad are trying to turn 
the adversity into an opportunity. Ananth 
applies geographic information system (GIS) 
tools to spatial data from satellites — whether 
providing data to telephone companies, 
about the best locations for their transmitters 
or to oil companies about the best route for 
laying pipelines. 


N. SEELAM/AFP/GETTY IMAGES 


ANANTH TECHNOLOGIES LTD. 
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“India is a huge marketplace for GIS, 
and it would be quite easy for the industry 
to absorb all its GIS professionals,” says 
Subba Rao Pavuluri, founder and managing 
director of Ananth. His firm also provides 
engineering design services for construction 
companies in the United States. “This is a 
software-driven service but the job requires 
structural and mechanical engineers,” he 
says. The company, which had just 40 people 
on its staff three years ago, now employs 
1,100 geologists, geomorphologists and 
engineers, among others. The firm also 
provides services to the aerospace industry 
and expects to benefit from a policy 
established in 2006 stating that foreign 
companies selling defence or space systems 
such as aircraft or satellites to India must 
buy components and software from the 
nation worth 30% of the contract. “This is 
going to be a big boost to service providers 
like us,’ says Pavuluri. Many aviation firms 
have plans to expand their operations in 
India, which means great opportunity for 
designing and engineering outsourcing. 

Even as they trim costs at home, some US 
companies continue to invest in India. In 
May, the industrial supply firm Honeywell 
International in Morristown, New Jersey, 
which employs 6,000 engineers at its 
Bangalore research-and-development site, 
opened a second, $50-million, 
facility in Bangalore that 
will employ 3,000 people in 
sectors such as aerospace 
and defence. Still, the future 
of Indian outsourcing could 
be with nations such as 
Japan. In a report entitled 
Global 1000 Re»D Spenders: 
India Opportunity Identified, 
Zinnov, a management- 
consulting company based 
in Bangalore says that Japan 
will potentially send a lot of 
its research and development 
in areas such as electronic 
equipment and automotive 
research to India. 


Clinical research 

Contract manufacturing and clinical- 
services research have become mainstays 
in India. But the economic downturn 

has rattled this sector, too, prompting 
professionals to discuss the sector’s status at 
the Pharmaceutical Outsourcing Congress 
in Mumbai on 28-30 July. Nearly 80 India- 
based contract research organizations 
employ about 22,000 workers, according 
to Venkat Jasti, an executive committee 
member of the Association of Contract 
Research Organizations. 

A. V. Rama Rao, the founder, chairman 
and managing director of Avra Laboratories, 
a contract research and manufacturing 
services organization in Hyderabad, says 


“The space- 
components policy 
is going to bea big 
boost to service 
providers like us.” — 
Subba Rao Pavuluri 


Indian radiologists such as Arjun Kalyanpur will interpret medical images sent in from abroad. 


that many other such organizations have 
had to cease operations owing to a lack of 
contracts. “Two years ago organic chemists 
were in great demand, he says. “Today 
there are no jobs.” Hyderabad-based GVK 
Biosciences reports that it is not hiring at its 
usual pace. The company currently employs 
about 800 chemists (150 of whom have 
PhDs), some 300 informatics specialists and 
200 clinical pharmacists. “Contract research 
organizations that depend on a single 

client or a single project are the ones most 
affected,” says Jang Bahadur 
Gupta, vice-president for 
collaborative research at 
GVK. 

Some such organizations, 
however, continue to thrive. 
Vasudeo Ginde, founding 
director of DiagnoSearch 
Life Sciences in Mumbai, 
says that his company has 
hired 30% more people this 
year, to make a total of more 
than 100. Ginde thinks that 
turnaround is inevitable. 
“India’s main attraction for 
the drug industry — its huge 
diseased population — will 
not go away,’ he says. Nor, 
presumably, will the sizable cost savings for 
offshore companies. Vis Niranjan, president 
of RxMD, a leading clinical research and 
development outsourcing company in 
Chennai, says that the recession in the 
United States has helped him hire highly 
skilled physician-scientists of Indian origin 
from the United States. “I am unable to 
employ physicians coming out of Indian 
medical schools,” he adds. “They may 
provide good patient care but not the good 
science that our industry needs” 

For global contract research organizations 
such as Quintiles Transnational, meanwhile, 
it’s business as usual. “We expect to continue 
to add jobs because we are experiencing 
double-digit growth in India,” says the firm’s 
spokeswoman Mahrukh Engineer. Quintiles, 
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which employs 23,000 people all over the 
world, has more than 1,300 employees at 
four sites in India — Ahmedabad, Bangalore, 
Mumbai and New Delhi — and will opena 
fifth in Hyderabad next year. According to 
Engineer, the firm has not had any contracts 
cancelled or significant delays in starting 

up clinical studies, a challenge that other 
contract-research organizations have faced 
this year. 

Some medical-outsourcing niches have 
seen little fall-off. Radiologists at several 
companies have started reading computed- 
tomography scans, magnetic-resonance 
images and other medical images sent by 
hospitals abroad. A team of radiologists at 
Teleradiology Solutions in Bangalore, for 
instance, provides radiology services for 
more than 70 hospitals in the United States 
and 10 hospitals in Singapore. The firm also 
plans to connect to hospitals in Africa. “We 
report nearly 1,000 scans a day and plan for 
doubling the number,’ says Arjun Kalyanpur, 
a radiologist trained in the United States, 
who, with his paediatrician wife, founded the 
company 7 years ago. 

More clinical trials in stem-cell therapy 
could be outsourced after the Indian 
government proposed new guidelines for 
researchers and clinicians. Soon, Parliament 
is expected to adopt the proposed legislation. 
Once companies have registered their stem- 
cell trials, researchers would be permitted to 
run them anywhere in India, irrespective of 
where the stem cells had been obtained from. 
So far, the government has cleared only two 
clinical trials, both by the firm Stempeutics 
in Bangalore. Its president, Balu Manohar, 
says that this may open up clinical-trial 
collaborations with foreign researchers. 

Such trends suggest that outsourcing 
in India’s medical sector still holds ample 
potential for growth. “We will catch up in no 
time,” says Jasti. “That is what our numbers 
say. IT professionals, meanwhile, may have a 
bigger, more challenging recovery ahead. ™ 
K. S. Jayaraman writes for Nature from 
Bangalore. 
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Makeover 


Off on the wrong track. 


James L. Cambias 


I stopped by Hub Hobbies down on 


Broad Street, and literally bumped KAN R 


into Mike LaSalle on the way in. He K 
put out an arm to keep me from fall- 
ing over, then broke into a grin. 

“Stephen! How’ve you been?” 

“Oh, fine, fine. I’m here to 
drool over some of the rolling 
stock. Did you see that really sweet 
brass Allegheny they just got in?” 

“Nope. I’m selling today, not buy- 
ing.” - 
“Really? What are you getting = * 
rid of?” Mike had some amazing 
stuff — being both a lawyer and single 
he could buy pretty much any 
model he wanted without hav- 
ing to live on ramen and tuna, 
the way I did. 

“The whole thing. I just 
worked out a consignment deal 
with the store.” 

“Everything? Your whole layout? How 
come? Are you switching to N-scale?” 

“Tm cured. Giving it all up. I’m selling 
the house and getting a condo in the Ware- 
house District.” 

“What? How? Why?” 

It was starting to rain so we moved inside 
and stood between the theft detector posts 
while he told me. 

“T signed up with MatchScan. Are you 
on that?” 

“No, I met Stevie before that got big. I 
never used dating services anyway.” 

“It’s more than a dating service. They 
actually read your brain. You put your 
head in this gadget called an fMRI, and 
then they show you a bunch of images 
and video clips. It reads how your brain 
reacts and builds up a personality profile. 
Tells what kind of person youre attracted 
to. Then they just match up people with 
compatible scans. Lots better than filling 
out questionnaires.” 

“No lying” 

“Exactly. They ve scanned thousands of 
couples to find out what personality types 
match up. I know some people who got 
together through MatchScan — they said 
it was love at first sight” 

“But what does that have to do with you 
selling your trains? Is it expensive?” 

He waved his hand dismissively. “I can 
afford it. No, the trains were the problem. 
The MatchScan people said it made me an 
‘outlier. Any woman I find attractive would 
hate being with a model railroader” 


There are times when I'm really glad ’m 
gay. Stevie and I both like trains. He keeps 
the locomotives running and handles the 
electronics, so I can spend my time paint- 
ing rolling stock and building terrain. A 
match made in Man Heaven. 

Something puzzled me. “Wait a second 
— getting rid of the trains won't help. It’s 
all based on that brain scan, right? Even 
if you don't have a single inch of track in 
your house that MRI will still tell you like 
them.” 

“That’s right.” Mike looked smug. 
“That’s why I signed up for the advanced 
programme.” 

“Which is?” 

“It's a step beyond profiling. They actu- 
ally modify your brain to make your profile 
more appealing” 

“What, they put you in a chair and show 
you newsreels with your eyelids clamped 
open?” 

Mike's not a movie buff so he looked 
puzzled. “No, it’s not like that. They use 
NMT. Neural Marker Therapy. Doctors 
use it to treat people with addictions” 

I dimly remembered Stevie talking about 
it. They give the patient a marker protein 
which binds to brain cells, then they expose 
him to material linked to the undesirable 
trait — alcohol, nicotine, child porn, what- 
ever. I guess for Mike it was playing with 
trains. A couple of hours later the patient 
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takes an antagonist drug, and then 
gets exposed to a bunch of unre- 
lated stimuli. Once it’s all done, only 
the part(s) of the brain concerned 
with the undesirable habit are marked 


patient a drug that binds to the marker 
protein and suppresses neurotransmit- 
ter function and blood flow. I expect it’s 
more complicated than it sounds. 

Not to mention kind of creepy. “Don't 
they use that in places like Syria to — “ 
“Yeah, and Burma, too. Probably a lot 
of countries. The FDA wont approve it 
for non-medical purposes here. Match- 

Scan has a referral deal with 
®° a clinic in the Bahamas. It’s a 

nice place; there’ a little beach 
resort you can stay at while youre 
being treated” 

“Did it work?” 
“Absolutely. All this,” he ges- 
tured at the shelves of model rail 
cars, miniature building kits, track, 
tools, paint, power supplies, reference 
books, bags of green sawdust, balsa wood, 
“doesn't interest me at all any more. It’s like 
being in a vegetarian restaurant. Or a jew- 
ellery store. Some of it’s kind of neat, but I 
just don't care” 

“Jesus, Mike. You used to love running 
trains. You said once youd been in love 
with model trains since you were four 
years old. Is it worth it?” 

“Yes, he said, and I was surprised at how 
intense he sounded. “Yes, it really is worth 
it. ’'m tired of being alone. Of six-month 
relationships. Of seeing really nice girls 
going around with these complete jerks 
and wondering why they didn't like me. 
Now I’ve gota list of potential matches two 
pages long. It is absolutely worth it?” 

There's not much one can say ina situa- 
tion like that. I wished him luck and turned 
to go pick through the boxes of train loot 
he'd left behind. He went out and almost 
bumped into another customer coming 
in. She stepped back to let him pass. He 
nodded absently at her, then headed for 
his car. 

She looked a couple of years younger 
than Mike, alittle chunky but with an intel- 
ligent smile. Her eyes lit up when she saw 
the gear hed left on the counter. a 
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